Predicting Targeted Therapy Resistance in Non-Small Cell Lung Cancer Using Multimodal Machine Learning

📅 2025-03-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Late-stage EGFR-mutant non-small cell lung cancer (NSCLC) patients lack reliable predictive tools for osimertinib resistance. Method: We propose the first interpretable multimodal machine learning framework integrating histopathological images, NGS-based genomic variants, clinical phenotypes, and demographic data. Our approach combines CNN and Transformer-based feature extraction, cross-modal alignment, survival modeling, and SHAP-based interpretability analysis. Results: Evaluated on a multicenter cohort, the model achieves a concordance index (c-index) of 0.82—significantly outperforming unimodal baselines (c-index: 0.75–0.77)—demonstrating robustness and clinical utility of heterogeneous data integration. Contribution: This work introduces the first interpretable multimodal paradigm specifically designed for tyrosine kinase inhibitor (TKI) resistance prediction, uniquely balancing high predictive accuracy with clinical traceability, thereby enabling more informed, individualized targeted therapy decisions.

Technology Category

Application Category

📝 Abstract
Lung cancer is the primary cause of cancer death globally, with non-small cell lung cancer (NSCLC) emerging as its most prevalent subtype. Among NSCLC patients, approximately 32.3% have mutations in the epidermal growth factor receptor (EGFR) gene. Osimertinib, a third-generation EGFR-tyrosine kinase inhibitor (TKI), has demonstrated remarkable efficacy in the treatment of NSCLC patients with activating and T790M resistance EGFR mutations. Despite its established efficacy, drug resistance poses a significant challenge for patients to fully benefit from osimertinib. The absence of a standard tool to accurately predict TKI resistance, including that of osimertinib, remains a critical obstacle. To bridge this gap, in this study, we developed an interpretable multimodal machine learning model designed to predict patient resistance to osimertinib among late-stage NSCLC patients with activating EGFR mutations, achieving a c-index of 0.82 on a multi-institutional dataset. This machine learning model harnesses readily available data routinely collected during patient visits and medical assessments to facilitate precision lung cancer management and informed treatment decisions. By integrating various data types such as histology images, next generation sequencing (NGS) data, demographics data, and clinical records, our multimodal model can generate well-informed recommendations. Our experiment results also demonstrated the superior performance of the multimodal model over single modality models (c-index 0.82 compared with 0.75 and 0.77), thus underscoring the benefit of combining multiple modalities in patient outcome prediction.
Problem

Research questions and friction points this paper is trying to address.

Predicting osimertinib resistance in NSCLC patients
Lack of standard tools for TKI resistance prediction
Multimodal ML model integrates diverse data for precision
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal machine learning predicts resistance
Integrates histology, NGS, clinical data
Achieves high c-index of 0.82
🔎 Similar Papers
No similar papers found.
P
Peiying Hua
Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
A
Andrea Olofson
Department of Pathology, Ochsner Health System, New Orleans, LA 70115, USA
F
Faraz Farhadi
Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
L
Liesbeth Hondelink
Department of Pathology, Leiden University Medical Center, Leiden, the Netherlands
G
Gregory Tsongalis
Department of Pathology and Laboratory Medicine, Dartmouth -Hitchcock Medical Center, Lebanon, NH 03756, USA
K
Konstantin Dragnev
Department of Medical Oncology, Dartmouth -Hitchcock Medical Center, Lebanon, NH 03756, USA
D
Dagmar Hoegemann Savellano
Department of Radiology, Dartmouth -Hitchcock Medical Center, Lebanon, NH 03756, USA
A
Arief Suriawinata
Department of Pathology and Laboratory Medicine, Dartmouth -Hitchcock Medical Center, Lebanon, NH 03756, USA
Laura Tafe
Laura Tafe
Dartmouth-Hitchcock Medical Center
PathologyLung CancerGynecologic CancersMolecular DiagnosticsHead and Neck Cancer
Saeed Hassanpour
Saeed Hassanpour
Professor at Dartmouth
Biomedical InformaticsMachine LearningAI for HealthcareMedical Imaging