An Inclusive Foundation Model for Generalizable Cytogenetics in Precision Oncology

πŸ“… 2025-05-21
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Chromosome abnormality identification is critical for genetic disorder diagnosis and precision oncology, yet existing AI approaches suffer from high annotation costs, incomplete dataset coverage, and poor generalizability. To address these limitations, we propose CHROMAβ€”the first inclusive foundation model designed for comprehensive chromosome abnormality detection across all major types, enabling robust generalization under resource-constrained settings (e.g., few-shot learning and severe class imbalance). CHROMA leverages self-supervised learning, pretrained on 84,000 clinical specimens (~4 million high-resolution karyotype images), and integrates multi-scale feature representation with anomaly-decoupled modeling. Extensive evaluations across diverse chromosome abnormality detection tasks demonstrate consistent and significant superiority over state-of-the-art methods. CHROMA markedly reduces reliance on expert annotations, enhances early detection of rare genomic aberrations, and advances the clinical deployment of scalable, trustworthy AI in cytogenetics.

Technology Category

Application Category

πŸ“ Abstract
Chromosome analysis is vital for diagnosing genetic disorders and guiding cancer therapy decisions through the identification of somatic clonal aberrations. However, developing an AI model are hindered by the overwhelming complexity and diversity of chromosomal abnormalities, requiring extensive annotation efforts, while automated methods remain task-specific and lack generalizability due to the scarcity of comprehensive datasets spanning diverse resource conditions. Here, we introduce CHROMA, a foundation model for cytogenomics, designed to overcome these challenges by learning generalizable representations of chromosomal abnormalities. Pre-trained on over 84,000 specimens (~4 million chromosomal images) via self-supervised learning, CHROMA outperforms other methods across all types of abnormalities, even when trained on fewer labelled data and more imbalanced datasets. By facilitating comprehensive mapping of instability and clonal leisons across various aberration types, CHROMA offers a scalable and generalizable solution for reliable and automated clinical analysis, reducing the annotation workload for experts and advancing precision oncology through the early detection of rare genomic abnormalities, enabling broad clinical AI applications and making advanced genomic analysis more accessible.
Problem

Research questions and friction points this paper is trying to address.

Addressing complexity and diversity of chromosomal abnormalities in AI models
Overcoming task-specific limitations and lack of generalizability in automated methods
Reducing expert annotation workload for reliable clinical genomic analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-supervised learning on 84,000 specimens
Generalizable chromosomal abnormality representations
Scalable solution for clinical genomic analysis
πŸ”Ž Similar Papers
No similar papers found.
Changchun Yang
Changchun Yang
KAUST
medical image analysisimaging science
W
Weiqian Dai
Xinhua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine
Yilan Zhang
Yilan Zhang
King Abdullah University of Science and Technology
Computer VisionMedical Image Analysis
S
Siyuan Chen
Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia, Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia, Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia
J
Jingdong Hu
Smiltec(Suzhou)Co.,Ltd
J
Junkai Su
Smiltec(Suzhou)Co.,Ltd
Y
Yuxuan Chen
Smiltec(Suzhou)Co.,Ltd
A
Ao Xu
Smiltec(Suzhou)Co.,Ltd
N
Na Li
Smiltec(Suzhou)Co.,Ltd
X
Xin Gao
Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia, Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia, Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia
Y
Yongguo Yu
Xinhua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine