SURE: Synergistic Uncertainty-aware Reasoning for Multimodal Emotion Recognition in Conversations

📅 2026-04-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Multimodal emotion recognition in conversational settings is often hindered by modality-specific noise and limited contextual reasoning capabilities. To address these challenges, this work proposes SURE, a novel framework that uniquely integrates uncertainty modeling with iterative contextual reasoning. SURE employs an uncertainty-aware mixture-of-experts module to suppress noisy modalities, enhances multi-turn dialogue modeling through an iterative inference mechanism, and introduces a Transformer-based gating module to effectively capture both intra- and inter-modality interactions. Experimental results demonstrate that SURE significantly outperforms state-of-the-art methods across multiple MERC benchmarks, achieving notable improvements in model robustness and fine-grained emotional reasoning.
📝 Abstract
Multimodal emotion recognition in conversations (MERC) requires integrating multimodal signals while being robust to noise and modeling contextual reasoning. Existing approaches often emphasize fusion but overlook uncertainty in noisy features and fine-grained reasoning. We propose SURE (Synergistic Uncertainty-aware REasoning) for MERC, a framework that improves robustness and contextual modeling. SURE consists of three components: an Uncertainty-Aware Mixture-of-Experts module to handle modality-specific noise, an Iterative Reasoning module for multi-turn reasoning over context, and a Transformer Gate module to capture intra- and inter-modal interactions. Experiments on benchmark MERC datasets show that SURE consistently outperforms state-of-the-art methods, demonstrating its effectiveness in robust multimodal reasoning. These results highlight the importance of uncertainty modeling and iterative reasoning in advancing emotion recognition in conversational settings.
Problem

Research questions and friction points this paper is trying to address.

Multimodal Emotion Recognition
Conversational Context
Uncertainty Modeling
Noise Robustness
Contextual Reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uncertainty-aware
Multimodal Emotion Recognition
Iterative Reasoning
Mixture-of-Experts
Transformer Gate
🔎 Similar Papers
No similar papers found.
Y
Yiqiang Cai
Guangdong Provincial Key Laboratory of Quantum Engineering and Quantum Materials & School of Electronic Science and Engineering (School of Microelectronics), South China Normal University
Chengyan Wu
Chengyan Wu
South China Normal University & SUN YAT-SEN University
Machine LearningNLPMultilingual IELLM AlignmentMultimodal
Bolei Ma
Bolei Ma
LMU Munich
LinguisticsNatural Language ProcessingComputational Social Science
B
Bo Chen
Shenzhen University
Y
Yun Xue
Guangdong Provincial Key Laboratory of Quantum Engineering and Quantum Materials & School of Electronic Science and Engineering (School of Microelectronics), South China Normal University
Julia Hirschberg
Julia Hirschberg
Columbia University
Spoken Language ProcessingNatural Language ProcessingProsody
Ziwei Gong
Ziwei Gong
Ph.D. candidate, Columbia University
NLPSpeech