Extending Sequence Length is Not All You Need: Effective Integration of Multimodal Signals for Gene Expression Prediction

📅 2026-02-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing methods for gene expression prediction often rely excessively on extending DNA sequence length while neglecting the effective integration of proximal multimodal epigenomic signals, thereby introducing spurious associations. To address this limitation, this work proposes Prism, a novel framework that systematically models the distinct biological roles of diverse epigenomic signals. Prism characterizes background chromatin states through high-dimensional feature combinations to differentiate functional regulatory signals from nonspecific background noise and incorporates backdoor adjustment—a causal inference technique—to mitigate confounding effects. Remarkably, using only short DNA sequences, Prism achieves state-of-the-art performance in gene expression prediction, demonstrating that precise integration of multimodal epigenomic signals is superior to merely increasing sequence length.

Technology Category

Application Category

📝 Abstract
Gene expression prediction, which predicts mRNA expression levels from DNA sequences, presents significant challenges. Previous works often focus on extending input sequence length to locate distal enhancers, which may influence target genes from hundreds of kilobases away. Our work first reveals that for current models, long sequence modeling can decrease performance. Even carefully designed algorithms only mitigate the performance degradation caused by long sequences. Instead, we find that proximal multimodal epigenomic signals near target genes prove more essential. Hence we focus on how to better integrate these signals, which has been overlooked. We find that different signal types serve distinct biological roles, with some directly marking active regulatory elements while others reflect background chromatin patterns that may introduce confounding effects. Simple concatenation may lead models to develop spurious associations with these background patterns. To address this challenge, we propose Prism, a framework that learns multiple combinations of high-dimensional epigenomic features to represent distinct background chromatin states and uses backdoor adjustment to mitigate confounding effects. Our experimental results demonstrate that proper modeling of multimodal epigenomic signals achieves state-of-the-art performance using only short sequences for gene expression prediction.
Problem

Research questions and friction points this paper is trying to address.

gene expression prediction
multimodal epigenomic signals
confounding effects
background chromatin patterns
spurious associations
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal integration
gene expression prediction
epigenomic signals
confounding mitigation
backdoor adjustment
🔎 Similar Papers
No similar papers found.
Zhao Yang
Zhao Yang
Renmin University of China
Machine LearningAI4ScienceLanguage ModelsDiffusion Models
Y
Yi Duan
Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China; Beijing Key Laboratory of Research on Large Models and Intelligent Governance; Engineering Research Center of Next-Generation Intelligent Search and Recommendation, MOE
J
Jiwei Zhu
Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China; Beijing Key Laboratory of Research on Large Models and Intelligent Governance; Engineering Research Center of Next-Generation Intelligent Search and Recommendation, MOE
Y
Ying Ba
Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China; Beijing Key Laboratory of Research on Large Models and Intelligent Governance; Engineering Research Center of Next-Generation Intelligent Search and Recommendation, MOE
Chuan Cao
Chuan Cao
Microsoft AI4Science
Computational BiologyVirologyGenomics
Bing Su
Bing Su
Renmin University of China
Computer VisionMachine LearningPattern Recognition