CoGR-MoE: Concept-Guided Expert Routing with Consistent Selection and Flexible Reasoning for Visual Question Answering

📅 2026-04-18
📈 Citations: 0
Influential: 0
📄 PDF

career value

189K/year
🤖 AI Summary
This work addresses the challenge in visual question answering where expert routing often suffers from either excessive stability or instability, leading to inconsistent reasoning for similar questions or insufficient flexibility. To mitigate this, the authors propose a concept-guided expert routing framework that leverages semantic information from answer choices during training to guide expert selection. Specifically, option features are re-weighted to generate discriminative representations that facilitate both option-wise comparison and contrastive learning. By integrating mixture-of-experts (MoE) architecture, semantic-guided routing, and contrastive optimization, the method enhances reasoning flexibility while preserving routing consistency. Extensive experiments demonstrate that the proposed framework significantly outperforms existing approaches across multiple visual question answering benchmarks, confirming its effectiveness and strong generalization capability.

Technology Category

Application Category

📝 Abstract
Visual Question Answering (VQA) requires models to identify the correct answer options based on both visual and textual evidence. Recent Mixture-of-Experts (MoE) methods improve option reasoning by grouping similar concepts or routing based on examples. However, unstable routing can lead to inconsistent expert selection in the same question type, while overly stable routing may reduce flexibility. To address this, we propose Concept-Guided Routing framework (CoGR-MoE), which incorporates semantics of the answer options to guide expert selection in the training phase. Next, option features are used to reweight the selected experts, producing discriminative representations for each candidate option. These option-level representations are further used for option comparison and optimized via contrastive learning. The experimental results indicate that CoGR-MoE delivers strong performance across multiple VQA tasks, demonstrating the effectiveness of our approach.
Problem

Research questions and friction points this paper is trying to address.

Visual Question Answering
Mixture-of-Experts
Expert Routing
Consistency
Flexibility
Innovation

Methods, ideas, or system contributions that make the work stand out.

Concept-Guided Routing
Mixture-of-Experts
Visual Question Answering
Contrastive Learning
Expert Selection
🔎 Similar Papers
No similar papers found.