SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data

πŸ“… 2025-04-11
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the scarcity of expert annotations, poor cross-modal generalization, and suboptimal zero-shot segmentation performance of existing foundation models (e.g., SAM) in medical image segmentation, this paper proposes SynthFMβ€”the first modality-agnostic synthetic medical image generation framework capable of training general-purpose segmentation foundation models without any real clinical data. Methodologically, SynthFM freezes the SAM encoder and introduces a novel anatomy-aware decoder from scratch; it jointly integrates anatomical priors and physics-informed imaging simulation to generate high-fidelity synthetic CT, MRI, and ultrasound images. Extensive zero-shot evaluation across nine real-world datasets and eleven anatomical structures demonstrates that SynthFM achieves an average 8.2% Dice improvement over SAM and MedSAM, significantly enhancing out-of-distribution generalization. This work marks the first successful realization of multi-modal medical image segmentation using foundation models trained exclusively on synthetic data.

Technology Category

Application Category

πŸ“ Abstract
Foundation models like the Segment Anything Model (SAM) excel in zero-shot segmentation for natural images but struggle with medical image segmentation due to differences in texture, contrast, and noise. Annotating medical images is costly and requires domain expertise, limiting large-scale annotated data availability. To address this, we propose SynthFM, a synthetic data generation framework that mimics the complexities of medical images, enabling foundation models to adapt without real medical data. Using SAM's pretrained encoder and training the decoder from scratch on SynthFM's dataset, we evaluated our method on 11 anatomical structures across 9 datasets (CT, MRI, and Ultrasound). SynthFM outperformed zero-shot baselines like SAM and MedSAM, achieving superior results under different prompt settings and on out-of-distribution datasets.
Problem

Research questions and friction points this paper is trying to address.

Addressing medical image segmentation challenges with synthetic data
Overcoming lack of annotated medical data for foundation models
Enhancing zero-shot segmentation performance across diverse medical modalities
Innovation

Methods, ideas, or system contributions that make the work stand out.

Synthetic data mimics medical image complexities
Pretrained encoder with decoder trained on SynthFM
Outperforms zero-shot baselines across diverse datasets
πŸ”Ž Similar Papers
No similar papers found.
Sourya Sengupta
Sourya Sengupta
University of Illinois Urbana Champaign
Medical ImagingDeep Learning
S
Satrajit Chakrabarty
GE HealthCare, San Ramon, CA, USA
K
K. Ravi
GE HealthCare, San Ramon, CA, USA
G
Gopal Avinash
GE HealthCare, San Ramon, CA, USA
R
Ravi Soni
GE HealthCare, San Ramon, CA, USA