Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning

📅 2026-03-16
📈 Citations: 0
Influential: 0
📄 PDF
📝 Abstract
Imbalanced data distribution remains a critical challenge in sequential learning, leading models to easily recognize frequent categories while failing to detect minority classes adequately. The Mixture-of-Experts model offers a scalable solution, yet its application is often hindered by parameter inefficiency, poor expert specialization, and difficulty in resolving prediction conflicts. To Master the Minority classes effectively, we propose the Uncertainty-based Multi-Expert fusion network (UME) framework. UME is designed with three core innovations: First, we employ Ensemble LoRA for parameter-efficient modeling, significantly reducing the trainable parameter count. Second, we introduce Sequential Specialization guided by Dempster-Shafer Theory (DST), which ensures effective specialization on the challenging-tailed classes. Finally, an Uncertainty-Guided Fusion mechanism uses DST's certainty measures to dynamically weigh expert opinions, resolving conflicts by prioritizing the most confident expert for reliable final predictions. Extensive experiments across four public hierarchical text classification datasets demonstrate that UME achieves state-of-the-art performance. We achieve a performance gain of up to 17.97\% over the best baseline on individual categories, while reducing trainable parameters by up to 10.32\%. The findings highlight that uncertainty-guided expert coordination is a principled strategy for addressing challenging-tailed sequence learning. Our code is available at https://github.com/CQUPTWZX/Multi-experts.
🔎 Similar Papers
No similar papers found.
Y
Ye Wang
Key Laboratory of Cyberspace Big Data Intelligent Security, Ministry of Education; School of Artificial Intelligence, Chongqing University of Post and Telecommunications, Chongqing 400065, China
Zixuan Wu
Zixuan Wu
Georgia Institute of Technology
Robotics
Lifeng Shen
Lifeng Shen
Associate Professor of CQUPT
Sequence ModelingGenerative ModelingRepresentation LearningTime Series Modeling
J
Jiang Xie
Key Laboratory of Cyberspace Big Data Intelligent Security, Ministry of Education; School of Artificial Intelligence, Chongqing University of Post and Telecommunications, Chongqing 400065, China
X
Xiaoling Wang
School of Computer Science and Technology, East China Normal University, Shanghai, 200062, China
Hong Yu
Hong Yu
Chongqing University of Posts & Telecommunications
rough setsknowledge automationthree-way clusteringthree-way decisionsweb intelligence
Guoyin Wang
Guoyin Wang
Chongqing University of Posts & Telecommunications
Artificial Intelligencerough setsdata miningknowledge technology