Building a General SimCLR Self-Supervised Foundation Model Across Neurological Diseases to Advance 3D Brain MRI Diagnoses

📅 2025-09-12

📈 Citations: 0

✨ Influential: 0

career value

239K/year

🤖 AI Summary

Existing self-supervised 3D brain MRI models suffer from limited task generalizability, insufficient spatial resolution, and poor clinical accessibility. Method: We propose the first high-resolution 3D SimCLR foundation model tailored for multiple neurological disorders. Trained on 18,759 unlabeled, multi-center structural MRI scans, it incorporates novel contrastive learning augmentations specifically designed for 3D medical imaging. Contribution/Results: On four downstream tasks—including Alzheimer’s disease prediction—the model achieves superior performance using only 20% of labeled data, significantly outperforming both MAE and supervised baselines. It demonstrates exceptional in-distribution and out-of-distribution generalization, as well as high data efficiency. To foster reproducibility and clinical translation, we fully open-source the trained models, training code, and preprocessed datasets—advancing standardization and real-world deployment of AI in neuroimaging.

Technology Category

Application Category

📝 Abstract

3D structural Magnetic Resonance Imaging (MRI) brain scans are commonly acquired in clinical settings to monitor a wide range of neurological conditions, including neurodegenerative disorders and stroke. While deep learning models have shown promising results analyzing 3D MRI across a number of brain imaging tasks, most are highly tailored for specific tasks with limited labeled data, and are not able to generalize across tasks and/or populations. The development of self-supervised learning (SSL) has enabled the creation of large medical foundation models that leverage diverse, unlabeled datasets ranging from healthy to diseased data, showing significant success in 2D medical imaging applications. However, even the very few foundation models for 3D brain MRI that have been developed remain limited in resolution, scope, or accessibility. In this work, we present a general, high-resolution SimCLR-based SSL foundation model for 3D brain structural MRI, pre-trained on 18,759 patients (44,958 scans) from 11 publicly available datasets spanning diverse neurological diseases. We compare our model to Masked Autoencoders (MAE), as well as two supervised baselines, on four diverse downstream prediction tasks in both in-distribution and out-of-distribution settings. Our fine-tuned SimCLR model outperforms all other models across all tasks. Notably, our model still achieves superior performance when fine-tuned using only 20% of labeled training samples for predicting Alzheimer's disease. We use publicly available code and data, and release our trained model at https://github.com/emilykaczmarek/3D-Neuro-SimCLR, contributing a broadly applicable and accessible foundation model for clinical brain MRI analysis.

Problem

Research questions and friction points this paper is trying to address.

Develops a self-supervised foundation model for 3D brain MRI analysis

Addresses limited generalization across neurological diseases and populations

Overcomes resolution and accessibility limitations in existing 3D MRI models

Innovation

Methods, ideas, or system contributions that make the work stand out.

SimCLR-based SSL foundation model

High-resolution 3D brain MRI analysis

Trained on diverse neurological disease datasets

🔎 Similar Papers

BrainWave: A Brain Signal Foundation Model for Clinical Applications