ConceptM$^3$oE: Concept-Guided Multimodal Mixture of Experts for Interpretable Computational Pathology

📅 2026-05-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current computational pathology models struggle to integrate multimodal diagnostic information and provide interpretable justifications for their decisions, particularly when morphological features alone are insufficient to distinguish complex tumor subtypes. This work proposes a concept-guided multimodal mixture-of-experts (MoE) architecture that, for the first time, embeds structured diagnostic concepts into the MoE framework. By decomposing multimodal evidence through modality-specific, redundant, and synergistic experts—and preserving original information via residual connections—the model achieves performance on par with unconstrained baselines on pediatric brain tumor and glioma datasets. Notably, it demonstrates over a 10% improvement in macro-F1 under few-shot settings, faster convergence, and generates reasoning trajectories validated by neuropathologists as clinically interpretable.
📝 Abstract
Healthcare models are transitioning from unimodal prediction toward multimodal reasoning over heterogeneous diagnostic inputs. In computational pathology, for complex tumor subtypes where morphology alone can be challenging to distinguish, pathology reports and molecular measurements may provide additional diagnostic evidence alongside whole-slide images, yet existing models often fail to clarify how diverse signals assemble into recognizable diagnostic concepts. We propose ConceptM$^3$oE (Concept Multimodal MoE), which embeds concept formation directly within interaction-aware mixture-of-experts (MoE) pathways. The architecture decomposes evidence into modality-specific, redundant, and synergistic experts, which are then projected into structured concept bottlenecks mapping latent features to a hierarchy of morphology and biomarker concepts. To prevent the information loss typical of interpretable bottlenecks, we utilize residual pathways within each expert to allow task-relevant signals to flow both through the concepts and directly to the final task prediction, so that high performance is maintained alongside interpretability. Across an institutional pediatric brain tumor cohort and a public glioma cohort, the framework delivers competitive performance to unconstrained models while producing reasoning traces validated by an independent neuropathologist. In data-limited regimes, ConceptM$^3$oE improves limited-data performance, increasing macro-F1 from 56.41% to 66.70% at small training sizes compared to non-concept-informed baselines, while also showing faster training convergence consistent with the regularizing effect of concept learning. This work offers a scalable path toward high-performance medical AI that is inherently verifiable and better aligned with the complex decision-making of clinical practice.
Problem

Research questions and friction points this paper is trying to address.

computational pathology
multimodal reasoning
interpretable AI
diagnostic concepts
tumor subtypes
Innovation

Methods, ideas, or system contributions that make the work stand out.

Concept Bottleneck
Multimodal Mixture of Experts
Interpretable AI
Computational Pathology
Residual Concept Pathways
🔎 Similar Papers
No similar papers found.
Xuan Wang
Xuan Wang
The University of Texas Rio Grande Valley
big data analyticscausal inferenceadvanced methodologies
Z
Zhongling Xu
University of Texas at Austin, Austin, TX, USA
G
Gopi Kannedhara
University of Texas at Austin, Austin, TX, USA
J
Joakim Nguyen
University of Texas at Austin, Austin, TX, USA
Jian Yu
Jian Yu
Auckland University of Technology
graph neural networksrecommender systemsdeep learningcomplex networksInternet computing
J
Jinrui Fang
University of Texas at Austin, Austin, TX, USA
A
Abdurrahmaan Baghdadi
University of Texas at Austin, Austin, TX, USA
Tianlong Chen
Tianlong Chen
Assistant Professor, CS@UNC Chapel Hill; Chief AI Scientist, hireEZ
Machine LearningAI4ScienceComputer VisionSparsity
A
Awais Naeem
University of Texas at Austin, Austin, TX, USA
C
Chandra Krishnan
Dell Children’s Medical Center, Austin, TX, USA
E
Edward Castillo
University of Texas at Austin, Austin, TX, USA
Andrew H. Song
Andrew H. Song
Postdoctoral fellow, Harvard Medical School
Computational pathologyStatistical signal processing
Ankita Shukla
Ankita Shukla
University of Nevada Reno
Deep LearningGeometric MethodsComputer VisionAI for Wildlife ConservationAI for Science
Ying Ding
Ying Ding
Bill & Lewis Suit Professor, School of Information, Dell Med, University of Texas at Austin
AI in HealthKnowledge GraphScience of Science
Nicholas Konz
Nicholas Konz
Ph.D. Candidate, Duke University
Deep LearningMedical Image AnalysisGenerative ModelsImage TranslationDomain Adaptation
H
Hairong Wang
University of Texas at Austin, Austin, TX, USA