A Foundational Generative Model for Breast Ultrasound Image Analysis

📅 2025-01-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address data scarcity, privacy constraints, and insufficient early-diagnostic accuracy in breast ultrasound image analysis, this paper introduces BUSGen—the first foundational generative model tailored to this domain. Trained on over 3.5 million de-identified ultrasound images via large-scale self-supervised pretraining and conditional diffusion modeling, BUSGen integrates anatomical-pathological joint representation learning and few-shot prompt-based fine-tuning to generate high-fidelity, task-specific synthetic data. It innovatively enables privacy-preserving data sharing and demonstrates statistical equivalence between generated and real data in downstream tasks (p < 0.0001). Experiments show BUSGen improves early-diagnostic sensitivity by 16.5% over the average performance of nine senior radiologists and significantly enhances downstream model generalizability. The model and a public demo platform are open-sourced.

Technology Category

Application Category

📝 Abstract
Foundational models have emerged as powerful tools for addressing various tasks in clinical settings. However, their potential development to breast ultrasound analysis remains untapped. In this paper, we present BUSGen, the first foundational generative model specifically designed for breast ultrasound image analysis. Pretrained on over 3.5 million breast ultrasound images, BUSGen has acquired extensive knowledge of breast structures, pathological features, and clinical variations. With few-shot adaptation, BUSGen can generate repositories of realistic and informative task-specific data, facilitating the development of models for a wide range of downstream tasks. Extensive experiments highlight BUSGen's exceptional adaptability, significantly exceeding real-data-trained foundational models in breast cancer screening, diagnosis, and prognosis. In breast cancer early diagnosis, our approach outperformed all board-certified radiologists (n=9), achieving an average sensitivity improvement of 16.5% (P-value<0.0001). Additionally, we characterized the scaling effect of using generated data which was as effective as the collected real-world data for training diagnostic models. Moreover, extensive experiments demonstrated that our approach improved the generalization ability of downstream models. Importantly, BUSGen protected patient privacy by enabling fully de-identified data sharing, making progress forward in secure medical data utilization. An online demo of BUSGen is available at https://aibus.bio.
Problem

Research questions and friction points this paper is trying to address.

Breast Ultrasound Analysis
Cancer Detection Accuracy
Data Security and Privacy Protection
Innovation

Methods, ideas, or system contributions that make the work stand out.

BUSGen
Breast Ultrasound Analysis
Privacy Protection
🔎 Similar Papers
No similar papers found.
H
Haojun Yu
Peking University
Youcheng Li
Youcheng Li
Peking University
Computer Vision
N
Nan Zhang
Peking University Cancer Hospital & Institute
Z
Zihan Niu
Peking Union Medical College Hospital
X
Xuantong Gong
Cancer Hospital, Chinese Academy of Medical Sciences
Y
Yanwen Luo
Peking Union Medical College Hospital
Haotian Ye
Haotian Ye
Computer Science Ph.D. at Stanford University
S
Siyu He
Stanford University
Q
Quanlin Wu
Peking University
W
Wangyan Qin
Peking University Cancer Hospital & Institute
M
Mengyuan Zhou
Peking Union Medical College Hospital
J
Jie Han
Cancer Hospital, Chinese Academy of Medical Sciences
J
Jia Tao
Peking Union Medical College Hospital
Z
Ziwei Zhao
Yizhun Medical AI Co., Ltd.
D
Di Dai
Peking University
D
Di He
Peking University
D
Dong Wang
Yizhun Medical AI Co., Ltd.
B
Binghui Tang
Nanchang People’s Hospital
L
Ling Huo
Peking University Cancer Hospital & Institute
James Zou
James Zou
Stanford University
Machine learningcomputational biologycomputational healthstatisticsbiotech
Q
Qingli Zhu
Peking Union Medical College Hospital
Y
Yong Wang
Cancer Hospital, Chinese Academy of Medical Sciences; The First Affiliated Hospital of China Medical University
L
Liwei Wang
Peking University