🤖 AI Summary
In cancer survival prediction, the pervasive incompleteness of multimodal data—such as genomics, medical imaging, and clinical records—severely degrades model performance and hinders mechanistic interpretability. To address this, we propose impuTMAE, an end-to-end Transformer-based model. Its core innovation lies in jointly modeling cross-modal interactions and performing missing modality imputation during masked autoencoding pretraining, enabling robust joint learning over heterogeneous, incomplete multimodal inputs. impuTMAE uniformly integrates DNA methylation, RNA-seq, MRI, whole-slide images (WSI), and clinical variables. Evaluated on TCGA-GBM/LGG and BraTS datasets, it achieves state-of-the-art performance in glioma survival prediction, with markedly improved robustness and generalizability across diverse missingness patterns. Moreover, its architecture facilitates post-hoc interpretability, offering a novel paradigm for explainable prognostic modeling under multimodal data incompleteness.
📝 Abstract
The use of diverse modalities, such as omics, medical images, and clinical data can not only improve the performance of prognostic models but also deepen an understanding of disease mechanisms and facilitate the development of novel treatment approaches. However, medical data are complex, often incomplete, and contains missing modalities, making effective handling its crucial for training multimodal models. We introduce impuTMAE, a novel transformer-based end-to-end approach with an efficient multimodal pre-training strategy. It learns inter- and intra-modal interactions while simultaneously imputing missing modalities by reconstructing masked patches. Our model is pre-trained on heterogeneous, incomplete data and fine-tuned for glioma survival prediction using TCGA-GBM/LGG and BraTS datasets, integrating five modalities: genetic (DNAm, RNA-seq), imaging (MRI, WSI), and clinical data. By addressing missing data during pre-training and enabling efficient resource utilization, impuTMAE surpasses prior multimodal approaches, achieving state-of-the-art performance in glioma patient survival prediction. Our code is available at https://github.com/maryjis/mtcp