3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography

πŸ“… 2025-02-04
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This paper addresses the challenges of poor generalization and scarcity of high-quality annotations in multi-disease detection from cranial CT images. To this end, we propose FM-CTβ€”the first 3D self-supervised foundation model specifically designed for cranial CT. Leveraging 362,000 unlabeled, non-contrast-enhanced 3D CT volumes, FM-CT introduces a novel 3D structured self-supervision paradigm that jointly integrates self-distillation-based discriminative learning and masked voxel modeling. The resulting model achieves significantly improved in-distribution (ID) and out-of-distribution (OOD) generalization, particularly excelling in few-shot downstream tasks. Evaluated on an internal dataset and three multi-center external cohorts, FM-CT consistently outperforms both scratch-trained models and existing 3D CT foundation models across diverse diagnostic tasks. It thereby establishes a new state-of-the-art benchmark for 3D AI-driven analysis of cranial CT.

Technology Category

Application Category

πŸ“ Abstract
Head computed tomography (CT) imaging is a widely-used imaging modality with multitudes of medical indications, particularly in assessing pathology of the brain, skull, and cerebrovascular system. It is commonly the first-line imaging in neurologic emergencies given its rapidity of image acquisition, safety, cost, and ubiquity. Deep learning models may facilitate detection of a wide range of diseases. However, the scarcity of high-quality labels and annotations, particularly among less common conditions, significantly hinders the development of powerful models. To address this challenge, we introduce FM-CT: a Foundation Model for Head CT for generalizable disease detection, trained using self-supervised learning. Our approach pre-trains a deep learning model on a large, diverse dataset of 361,663 non-contrast 3D head CT scans without the need for manual annotations, enabling the model to learn robust, generalizable features. To investigate the potential of self-supervised learning in head CT, we employed both discrimination with self-distillation and masked image modeling, and we construct our model in 3D rather than at the slice level (2D) to exploit the structure of head CT scans more comprehensively and efficiently. The model's downstream classification performance is evaluated using internal and three external datasets, encompassing both in-distribution (ID) and out-of-distribution (OOD) data. Our results demonstrate that the self-supervised foundation model significantly improves performance on downstream diagnostic tasks compared to models trained from scratch and previous 3D CT foundation models on scarce annotated datasets. This work highlights the effectiveness of self-supervised learning in medical imaging and sets a new benchmark for head CT image analysis in 3D, enabling broader use of artificial intelligence for head CT-based diagnosis.
Problem

Research questions and friction points this paper is trying to address.

Develops 3D AI for head CT disease detection
Addresses scarcity of high-quality medical annotations
Uses self-supervised learning for robust feature extraction
Innovation

Methods, ideas, or system contributions that make the work stand out.

3D self-supervised learning
Foundation Model for Head CT
Masked image modeling
Weicheng Zhu
Weicheng Zhu
Center for Data Science, New York University
Machine learning
Haoxu Huang
Haoxu Huang
New York University, Center for Data Science, New York, NY, 10001, USA
H
Huanze Tang
New York University, Center for Data Science, New York, NY, 10001, USA
Rushabh Musthyala
Rushabh Musthyala
NYU Courant Institute of Mathematical Sciences
Applied AIComputer Vision
Boyang Yu
Boyang Yu
New York University
probabilistic machine learningfoundation models
L
Long Chen
New York University, Center for Data Science, New York, NY, 10001, USA
E
Emilio Vega
NYU Grossman School of Medicine, Department of Radiology, New York, NY, 10016, USA
Thomas O'Donnell
Thomas O'Donnell
Senior Key Expert, Siemens Healthineers
Computed TomographyMedical ImagingArtificial Intelligence
S
S. Dehkharghani
NYU Grossman School of Medicine, Department of Radiology, New York, NY, 10016, USA
Jennifer A. Frontera
Jennifer A. Frontera
NYU Grossman School of Medicine, Department of Neurology, New York, NY, 10016, USA
A
Arjun V. Masurkar
NYU Grossman School of Medicine, Department of Neurology, New York, NY, 10016, USA; NYU Grossman School of Medicine, Department of Neuroscience and Physiology, New York, NY, 10016, USA; NYU Grossman School of Medicine, Neuroscience Institute, New York, NY, 10016, USA
K
Kara R. Melmed
NYU Grossman School of Medicine, Department of Neurology, New York, NY, 10016, USA
N
N. Razavian
NYU Grossman School of Medicine, Department of Radiology, New York, NY, 10016, USA; NYU Grossman School of Medicine, Department of Population Health, New York, NY, 10016, USA