MER 2026: From Discriminative Emotion Recognition to Generative Emotion Understanding

📅 2026-04-21
📈 Citations: 0
Influential: 0
📄 PDF

career value

205K/year
🤖 AI Summary
Traditional emotion recognition approaches are often confined to discrete categorical labels, limiting their capacity to capture the nuanced affective states inherent in human interactions. This work proposes a generative emotion understanding framework that systematically integrates dyadic interaction modeling, fine-grained semantic descriptions, human preference learning, and physiological signals. Built upon multimodal large language models (MLLMs), the framework synergistically combines multimodal data fusion, natural language generation, and preference-aware learning. The study establishes four challenge tracks and introduces a large-scale benchmark platform and dataset, thereby advancing affective computing toward a new paradigm characterized by generative capabilities, multimodal integration, interpretability, and human-centered evaluation.

Technology Category

Application Category

📝 Abstract
MER2026 marks the fourth edition of the MER series of challenges. The MER series provides valuable data resources to the research community and offers tasks centered on recent research trends, establishing itself as one of the largest challenges in the field. Throughout its history, the focus of MER has shifted from discriminative emotion recognition to generative emotion understanding. Specifically, MER2023 concentrated on discriminative emotion recognition, restricting the emotion recognition scope to fixed basic labels. In MER2024 and MER2025, we transitioned to generative emotion understanding and introduced two new tasks: fine-grained emotion recognition and descriptive emotion analysis, aiming to leverage the extensive vocabulary and multimodal understanding capabilities of Multimodal Large Language Models (MLLMs) to facilitate fine-grained and explainable emotion recognition. Building on this trajectory, MER2026 continues to follow these research trends and contains four tracks: MER-Cross shifts the focus from individual to dyadic interaction scenarios; MER-FG centers on fine-grained emotion recognition; MER-Prefer aims to predict human preferences regarding different emotion descriptions; MER-PS focuses on emotion recognition based on physiological signals. More details regarding the dataset and baselines are available at https://zeroqiaoba.github.io/MER-Challenge.
Problem

Research questions and friction points this paper is trying to address.

emotion recognition
generative emotion understanding
fine-grained emotion
multimodal learning
human preference
Innovation

Methods, ideas, or system contributions that make the work stand out.

generative emotion understanding
multimodal large language models
fine-grained emotion recognition
dyadic interaction
physiological signals
Zheng Lian
Zheng Lian
Associate Professor, IEEE/CCF Senior Member, Institute of Automation, Chinese Academy of Sciences
Affective ComputingSentiment AnalysisMachine Learning
Xiaojiang Peng
Xiaojiang Peng
Shenzhen Technology University
Computer VisionFacial Expression RecognitionMultimodal Emotion Recognition
K
Kele Xu
National University of Defense Technology
Z
Ziyu Jia
Institute of Automation, CAS
X
Xinyi Che
Sichuan University
Zebang Cheng
Zebang Cheng
Shenzhen University
AICVMLLMAffective Computing
F
Fei Ma
Guangdong Lab of Artificial Intelligence and Digital Economy (SZ)
Laizhong Cui
Laizhong Cui
Shenzhen University
NetworkingEdge ComputingIoTBig Data,Machine Learning
Yazhou Zhang
Yazhou Zhang
Associate Professor, Tianjin University
Sentiment AnalysisQuantum CognitionSarcasm DetectionHumor Analysis
Xin Liu
Xin Liu
ShanghaiTech University
stochastic systemsonline learning
Liang Yang
Liang Yang
Dalian University of Technology
NLP
Jia Li
Jia Li
Hefei University of Technology <== USTC
Affective ComputingComputer VisionMultimodal Learning
Fan Zhang
Fan Zhang
CSE PhD Student, The Chinese University of Hong Kong (CUHK)
Large Language ModelsAI for ScienceMultimodal Learning
Erik Cambria
Erik Cambria
Professor @ NTU CCDS & Visiting @ MIT Media Lab
Neurosymbolic AIMultimodal InteractionNLPAffective ComputingSentiment Analysis
Guoying Zhao
Guoying Zhao
Academy Professor, IEEE Fellow, Professor of Computer Science and Engineering, University of Oulu
Affective ComputingArtificial IntelligenceComputer VisionPattern Recognition
B
Björn W. Schuller
Technical University of Munich
J
Jianhua Tao
Tsinghua University