Multimodal Doctor-in-the-Loop: A Clinically-Guided Explainable Framework for Predicting Pathological Response in Non-Small Cell Lung Cancer

📅 2025-05-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Accurately predicting pathological complete response (pCR) following neoadjuvant therapy for non-small-cell lung cancer (NSCLC) remains challenging due to difficulties in effectively fusing imaging and clinical data and the limited interpretability of existing deep learning models. Method: We propose a physician-in-the-loop multimodal deep learning framework. First, we introduce a novel clinical-knowledge-guided “physician-in-the-loop” mechanism, embedding radiologist-annotated lesion priors into model training. Second, we design a mid-level dynamic fusion architecture for imaging and clinical features, integrated with an interpretable attention module to enable progressive lesion focusing. Contribution/Results: Evaluated on a public cohort, our framework significantly improves pCR prediction accuracy (+5.2%). It generates clinically interpretable saliency maps and transparent decision rationales. To our knowledge, this is the first reproducible, verifiable, and clinically deployable multimodal explainable AI paradigm for NSCLC neoadjuvant response assessment.

Technology Category

Application Category

📝 Abstract
This study proposes a novel approach combining Multimodal Deep Learning with intrinsic eXplainable Artificial Intelligence techniques to predict pathological response in non-small cell lung cancer patients undergoing neoadjuvant therapy. Due to the limitations of existing radiomics and unimodal deep learning approaches, we introduce an intermediate fusion strategy that integrates imaging and clinical data, enabling efficient interaction between data modalities. The proposed Multimodal Doctor-in-the-Loop method further enhances clinical relevance by embedding clinicians' domain knowledge directly into the training process, guiding the model's focus gradually from broader lung regions to specific lesions. Results demonstrate improved predictive accuracy and explainability, providing insights into optimal data integration strategies for clinical applications.
Problem

Research questions and friction points this paper is trying to address.

Predicting pathological response in non-small cell lung cancer
Integrating imaging and clinical data via multimodal fusion
Embedding clinician knowledge to enhance model relevance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal Deep Learning with explainable AI
Intermediate fusion of imaging and clinical data
Clinician-guided training for enhanced relevance
Alice Natalina Caragliano
Alice Natalina Caragliano
Università Campus Bio-Medico di Roma
C
C. Tacconi
Fondazione Policlinico Universitario Campus Bio-Medico, Rome, Italy
C
Carlo Greco
Research Unit of Radiation Oncology, Department of Medicine and Surgery, Universit`a Campus Bio-Medico di Roma, Rome, Italy
L
L. Nibid
Fondazione Policlinico Universitario Campus Bio-Medico, Rome, Italy
E
E. Ippolito
Research Unit of Radiation Oncology, Department of Medicine and Surgery, Universit`a Campus Bio-Medico di Roma, Rome, Italy
M
Michele Fiore
Fondazione Policlinico Universitario Campus Bio-Medico, Rome, Italy
G
Giuseppe Perrone
Research Unit of Anatomical Pathology, Department of Medicine and Surgery, Universit`a Campus Bio-Medico di Roma, Rome, Italy
Sara Ramella
Sara Ramella
Associate Professor of Radiation Oncology Campus Bio-Medico University of Rome
Radioterapia Oncologica
P
P. Soda
Research Unit of Computer Systems and Bioinformatics, Department of Engineering, Universit`a Campus Bio-Medico di Roma, Rome, Italy; Department of Diagnostics and Intervention, Radiation Physics, Biomedical Engineering, Ume˚a University, Ume˚a, Sweden
Valerio Guarrasi
Valerio Guarrasi
Università Campus Bio-Medico di Roma, Italy
Artificial IntelligenceMachine LearningMultimodal Deep LearningGenerative AI