EchoJEPA: A Latent Predictive Foundation Model for Echocardiography

📅 2026-02-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of disentangling anatomical structures from speckle noise and acquisition artifacts in echocardiography by introducing, for the first time in this domain, a latent prediction paradigm. Leveraging 18 million unlabeled images, a large-scale foundation model is developed to learn robust representations of cardiac anatomy through a latent prediction objective. The approach employs a frozen-backbone multi-view probing framework combined with a physics-informed acoustic perturbation evaluation strategy. Remarkably, using only 1% of labeled data, the model achieves 79% accuracy in view classification and improves estimation performance by approximately 20% for left ventricular ejection fraction and 17% for right ventricular systolic pressure. It demonstrates exceptional zero-shot generalization—particularly outperforming fine-tuned baselines on pediatric patients—and exhibits superior robustness to perturbations compared to existing methods.

Technology Category

Application Category

📝 Abstract
Foundation models for echocardiography often struggle to disentangle anatomical signal from the stochastic speckle and acquisition artifacts inherent to ultrasound. We present EchoJEPA, a foundation model trained on 18 million echocardiograms across 300K patients, representing the largest pretraining corpus for this modality to date. By leveraging a latent predictive objective, EchoJEPA learns robust anatomical representations that ignore speckle noise. We validate this using a novel multi-view probing framework with frozen backbones, where EchoJEPA outperforms leading baselines by approximately 20% in left ventricular ejection fraction (LVEF) estimation and 17% in right ventricular systolic pressure (RVSP) estimation. The model also exhibits remarkable sample efficiency, reaching 79% view classification accuracy with only 1% of labeled data versus 42% for the best baseline trained on 100%. Crucially, EchoJEPA demonstrates superior generalization, degrading by only 2% under physics-informed acoustic perturbations compared to 17% for competitors. Most remarkably, its zero-shot performance on pediatric patients surpasses fully fine-tuned baselines, establishing latent prediction as a superior paradigm for robust, generalizable medical AI.
Problem

Research questions and friction points this paper is trying to address.

echocardiography
speckle noise
acquisition artifacts
anatomical signal disentanglement
ultrasound imaging
Innovation

Methods, ideas, or system contributions that make the work stand out.

latent predictive modeling
echocardiography foundation model
speckle-robust representation
zero-shot generalization
sample efficiency
🔎 Similar Papers
No similar papers found.
Alif Munim
Alif Munim
Ryerson University
machine learningdeep learningagentsvideo understanding
A
Adibvafa Fallahpour
University Health Network, Vector Institute, University of Toronto
T
Teodora Szasz
University of Chicago, Philips Health
A
Ahmadreza Attarpour
University Health Network
River Jiang
River Jiang
University of British Columbia
CardiologyMachine Learning
B
Brana Sooriyakanthan
University Health Network
M
Maala Sooriyakanthan
University Health Network
H
Heather Whitney
University of Chicago
J
J. Slivnick
University of Chicago
B
Barry Rubin
University Health Network, University of Toronto
W
Wendy Tsang
University Health Network, University of Toronto
Bo Wang
Bo Wang
Chief AI Scientist,UHN; CIFAR AI Chair, Vector Institute; Assistant Professor, U of Toronto
Machine LearningComputational BiologyComputer Vision