One Sequence to Segment Them All: Efficient Data Augmentation for CT and MRI Cross-Domain 3D Spine Segmentation

📅 2026-05-04
📈 Citations: 0
Influential: 0
📄 PDF

career value

173K/year
🤖 AI Summary
This work addresses the limited generalization of models trained on single-sequence data across cross-modality (CT/MRI) and cross-contrast medical images, as well as performance degradation due to scarce annotated data. To this end, the authors propose an efficient data augmentation strategy tailored for cross-modality transfer. Their approach is the first to systematically quantify the impact of data augmentation on cross-modality generalization and introduces a GPU-optimized augmentation pipeline. Using only single-sequence training data, the method substantially improves model robustness for spine segmentation on unseen CT/MRI sequences. Experiments demonstrate an average 155% increase in Dice coefficient across seven out-of-domain datasets, with negligible in-domain performance loss (average drop of merely 0.008%) and approximately 10% higher training efficiency. The accompanying toolbox is open-sourced and compatible with mainstream frameworks such as nnUNet and MONAI.
📝 Abstract
Deep learning-based medical image segmentation is increasingly used to support clinical diagnosis and develop new treatment strategies. However, model performance remains limited by the scarcity of high-quality annotated data and insufficient generalization across imaging protocols. This limitation is particularly evident in MRI and CT, where models are typically trained on a single acquisition sequence and exhibit reduced robustness when applied to unseen sequences or contrasts. Although data augmentation is widely used to improve general robustness on medical images, its impact on cross-modality generalization has not been quantitatively explored. In this work, we study a targeted set of data augmentation techniques designed to improve cross-modality transfer. We train three spine segmentation models, each on a single-modality/sequence dataset, and evaluate them across seven out-of-distribution datasets (spanning CT and MRI), reflecting a realistic single-sequence training and multi-sequence/contrast/modality deployment scenario. Our results demonstrate substantial performance gains on unseen domains (average Dice gain of 155 %) while preserving in-domain accuracy (average Dice decrease of 0.008 %), including effective transfer between CT and MRI. To mitigate the computational cost typically associated with strong data augmentation, we implement GPU-optimized augmentations that maintain, and even improve, training efficiency by approximately 10 %. We release our approach as an open-source toolbox, enabling seamless integration into commonly used frameworks such as nnUNet and MONAI. These augmentations significantly enhance robustness to heterogeneous clinical imaging scenarios without compromising training speed.
Problem

Research questions and friction points this paper is trying to address.

cross-modality
data augmentation
medical image segmentation
domain generalization
spine segmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

cross-modality segmentation
data augmentation
spine segmentation
domain generalization
GPU-accelerated augmentation
🔎 Similar Papers
No similar papers found.
N
Nathan Molinier
NeuroPoly Lab, Institute of Biomedical Engineering, Polytechnique Montreal
H
Hendrik Möller
Department for Interventional and Diagnostic Neuroradiology, TUM University Hospital
T
Thomas Dagonneau
NeuroPoly Lab, Institute of Biomedical Engineering, Polytechnique Montreal
A
Anna Curto-Vilalta
Chair for AI in Healthcare and Medicine, Technical University of Munich (TUM)
R
Robert Graf
Department for Interventional and Diagnostic Neuroradiology, TUM University Hospital
M
Matan Atad
Department for Interventional and Diagnostic Neuroradiology, TUM University Hospital
Daniel Rueckert
Daniel Rueckert
Technical University of Munich and Imperial College London
Machine LearningMedical Image ComputingBiomedical Image AnalysisComputer Vision
J
Jan S. Kirschke
Department for Interventional and Diagnostic Neuroradiology, TUM University Hospital
J
Julien Cohen-Adad
NeuroPoly Lab, Institute of Biomedical Engineering, Polytechnique Montreal