Bridging the gap between Performance and Interpretability: An Explainable Disentangled Multimodal Framework for Cancer Survival Prediction

📅 2026-03-02
📈 Citations: 0
Influential: 0
📄 PDF

career value

188K/year
🤖 AI Summary
Multimodal cancer survival prediction often struggles to balance predictive performance with model interpretability. To address this challenge, this work proposes the DIMAFx framework, which disentangles whole-slide histopathology images and transcriptomic data to learn both modality-specific and shared representations. By doing so, DIMAFx achieves state-of-the-art predictive accuracy while enhancing model transparency. Notably, it is the first approach to unify high performance with high interpretability in multimodal survival analysis. Integrated with SHAP-based feature attribution, the framework reveals biologically meaningful interactions between modalities. Validation across multiple cancer cohorts demonstrates that DIMAFx successfully identifies shared risk features in breast cancer associated with high-grade morphology and estrogen response pathways, as well as modality-specific signals linked to the tumor microenvironment.

Technology Category

Application Category

📝 Abstract
While multimodal survival prediction models are increasingly more accurate, their complexity often reduces interpretability, limiting insight into how different data sources influence predictions. To address this, we introduce DIMAFx, an explainable multimodal framework for cancer survival prediction that produces disentangled, interpretable modality-specific and modality-shared representations from histopathology whole-slide images and transcriptomics data. Across multiple cancer cohorts, DIMAFx achieves state-of-the-art performance and improved representation disentanglement. Leveraging its interpretable design and SHapley Additive exPlanations, DIMAFx systematically reveals key multimodal interactions and the biological information encoded in the disentangled representations. In breast cancer survival prediction, the most predictive features contain modality-shared information, including one capturing solid tumor morphology contextualized primarily by late estrogen response, where higher-grade morphology aligned with pathway upregulation and increased risk, consistent with known breast cancer biology. Key modality-specific features capture microenvironmental signals from interacting adipose and stromal morphologies. These results show that multimodal models can overcome the traditional trade-off between performance and explainability, supporting their application in precision medicine.
Problem

Research questions and friction points this paper is trying to address.

multimodal
survival prediction
interpretability
cancer
disentangled representation
Innovation

Methods, ideas, or system contributions that make the work stand out.

disentangled representation
explainable AI
multimodal learning
cancer survival prediction
SHAP
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
A
Aniek Eijpe
AI Technology for Life, Department of Information and Computing Sciences, Department of Biology, Utrecht University, Utrecht, The Netherlands
S
Soufyan Lakbir
AI Technology for Life, Department of Information and Computing Sciences, Department of Biology, Utrecht University, Utrecht, The Netherlands; Department of Metabolic Diseases, Wilhelmina Children’s Hospital, University Medical Center Utrecht, Utrecht, The Netherlands; Regenerative Medicine Center Utrecht, Utrecht, The Netherlands
M
Melis Erdal Cesur
Computational Pathology, Department of Pathology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
Sara P. Oliveira
Sara P. Oliveira
Postdoctoral researcher | Computational Pathology group | The Netherlands Cancer Institute
Computational PathologyMedical Image AnalysisDeep LearningComputer Vision
Angelos Chatzimparmpas
Angelos Chatzimparmpas
Assistant Professor, Utrecht University
Information VisualizationVisual AnalyticsExplainable MLTrustworthy MLExplainable AI
Sanne Abeln
Sanne Abeln
Professor of AI Technology for Life, Utrecht University
AI for the Life SciencesProtein BioinformaticsGenomic AlterationsNeurodegenerative Disease.
Wilson Silva
Wilson Silva
Assistant Professor, AI Technology for Life, Utrecht University
Machine LearningComputer VisionExplainable AIMedical Image AnalysisPrivacy