Robust Multi-Omics Integration from Incomplete Modalities Significantly Improves Prediction of Alzheimer's Disease

📅 2025-09-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Incomplete multi-omics data—characterized by frequent modality-wise missingness—severely hinder accurate modeling and biological interpretation of Alzheimer’s disease (AD). To address this, we propose MOIRA, a robust integrative framework for incomplete multi-omics data. MOIRA first aligns heterogeneous modalities into a shared latent embedding space via representation alignment; then employs a learnable-weight early-fusion mechanism for adaptive feature aggregation; and finally incorporates feature importance analysis to identify disease-relevant biomarkers. Evaluated on the ROSMAP cohort, MOIRA significantly outperforms state-of-the-art methods (p < 0.01) in both AD staging and cognitive decline prediction. Biologically, it successfully recapitulates established AD risk genes—including APOE and TREM2—as well as synaptic function pathways, demonstrating strong interpretability and clinical translational potential.

Technology Category

Application Category

📝 Abstract
Multi-omics data capture complex biomolecular interactions and provide insights into metabolism and disease. However, missing modalities hinder integrative analysis across heterogeneous omics. To address this, we present MOIRA (Multi-Omics Integration with Robustness to Absent modalities), an early integration method enabling robust learning from incomplete omics data via representation alignment and adaptive aggregation. MOIRA leverages all samples, including those with missing modalities, by projecting each omics dataset onto a shared embedding space where a learnable weighting mechanism fuses them. Evaluated on the Religious Order Study and Memory and Aging Project (ROSMAP) dataset for Alzheimer's Disease (AD), MOIRA outperformed existing approaches, and further ablation studies confirmed modality-wise contributions. Feature importance analysis revealed AD-related biomarkers consistent with prior literature, highlighting the biological relevance of our approach.
Problem

Research questions and friction points this paper is trying to address.

Integrating multi-omics data with missing modalities for disease prediction
Addressing incomplete omics data hindering robust integrative analysis
Improving Alzheimer's Disease prediction using incomplete multi-omics datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Robust multi-omics integration with incomplete data
Uses representation alignment and adaptive aggregation
Projects data onto shared embedding space for fusion
S
Sungjoon Park
LG AI Research, Seoul, Republic of Korea
K
Kyungwook Lee
LG AI Research, Seoul, Republic of Korea
Soorin Yim
Soorin Yim
AI Scientist at LG AI Research
BioinformaticsDeep learning
Doyeong Hwang
Doyeong Hwang
LG AI Research, Seoul, Republic of Korea
D
Dongyun Kim
LG AI Research, Seoul, Republic of Korea; Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Soonyoung Lee
Soonyoung Lee
LG AI Research
Computer VisionMachine Learning
Amy Dunn
Amy Dunn
The Jackson Laboratory
Neuroscience
D
Daniel Gatti
The Jackson Laboratory, Bar Harbor, Maine, USA
E
Elissa Chesler
The Jackson Laboratory, Bar Harbor, Maine, USA
Kristen O'Connell
Kristen O'Connell
Associate Professor, The Jackson Laboratory
Neurosciencesystems neuroscienceenergy balanceneurogenetics
Kiyoung Kim
Kiyoung Kim
LG AI Research, Seoul, Republic of Korea