JEPA-DNA: Grounding Genomic Foundation Models through Joint-Embedding Predictive Architectures

📅 2026-02-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes JEPA-DNA, a novel genomic foundation model that addresses the limitations of existing approaches relying solely on masked language modeling (MLM) or next-token prediction (NTP), which often fail to capture global functional context and yield biologically fragmented representations. JEPA-DNA introduces the Joint Embedding Predictive Architecture (JEPA) into genomic pretraining for the first time, supervising the CLS token in latent space to predict high-order functional embeddings of masked regions rather than merely reconstructing individual nucleotides. The framework synergistically integrates JEPA with MLM and NTP objectives, enabling either training from scratch or continual enhancement of existing models. Experimental results demonstrate that JEPA-DNA consistently outperforms purely generative baselines across multiple genomic benchmark tasks, achieving superior performance in both supervised and zero-shot settings while producing more robust and biologically meaningful representations.

Technology Category

Application Category

📝 Abstract
Genomic Foundation Models (GFMs) have largely relied on Masked Language Modeling (MLM) or Next Token Prediction (NTP) to learn the language of life. While these paradigms excel at capturing local genomic syntax and fine-grained motif patterns, they often fail to capture the broader functional context, resulting in representations that lack a global biological perspective. We introduce JEPA-DNA, a novel pre-training framework that integrates the Joint-Embedding Predictive Architecture (JEPA) with traditional generative objectives. JEPA-DNA introduces latent grounding by coupling token-level recovery with a predictive objective in the latent space by supervising a CLS token. This forces the model to predict the high-level functional embeddings of masked genomic segments rather than focusing solely on individual nucleotides. JEPA-DNA extends both NTP and MLM paradigms and can be deployed either as a standalone from-scratch objective or as a continual pre-training enhancement for existing GFMs. Our evaluations across a diverse suite of genomic benchmarks demonstrate that JEPA-DNA consistently yields superior performance in supervised and zero-shot tasks compared to generative-only baselines. By providing a more robust and biologically grounded representation, JEPA-DNA offers a scalable path toward foundation models that understand not only the genomic alphabet, but also the underlying functional logic of the sequence.
Problem

Research questions and friction points this paper is trying to address.

Genomic Foundation Models
Masked Language Modeling
Next Token Prediction
functional context
biological grounding
Innovation

Methods, ideas, or system contributions that make the work stand out.

Joint-Embedding Predictive Architecture
Genomic Foundation Models
Latent Grounding
Functional Embedding
Continual Pre-training
🔎 Similar Papers
No similar papers found.
A
Ariel Larey
Applied AI Architecture, NVIDIA, Israel
E
Elay Dahan
Worldwide Field Ops, NVIDIA, Israel
Amit Bleiweiss
Amit Bleiweiss
NVIDIA
Deep LearningComputer Vision
R
Raizy Kellerman
Cancer Research Center and Wohl Institute of Translational Medicine, Sheba Medical Center, Tel Hashomer, Israel
G
Guy Leib
Cancer Research Center and Wohl Institute of Translational Medicine, Sheba Medical Center, Tel Hashomer, Israel
O
Omri Nayshool
Cancer Research Center and Wohl Institute of Translational Medicine, Sheba Medical Center, Tel Hashomer, Israel
Dan Ofer
Dan Ofer
Hebrew University
Machine LearningBioinformaticsNLPProteomicsautoML
T
Tal Zinger
Cancer Research Center and Wohl Institute of Translational Medicine, Sheba Medical Center, Tel Hashomer, Israel
D
Dan Dominissini
Cancer Research Center and Wohl Institute of Translational Medicine, Sheba Medical Center, Tel Hashomer, Israel
G
Gideon Rechavi
Cancer Research Center and Wohl Institute of Translational Medicine, Sheba Medical Center, Tel Hashomer, Israel
N
Nicole Bussola
Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, USA
S
Simon Lee
Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, USA
S
Shane O'Connell
Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, USA
D
Dung Hoang
Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, USA
M
Marissa Wirth
Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, USA
A
Alexander W. Charney
Windreich Department of AI and Human Health, Icahn School of Medicine at Mount Sinai, New York, USA
N
Nati Daniel
Applied AI Architecture, NVIDIA, Israel
Yoli Shavit
Yoli Shavit
NVIDIA