π€ AI Summary
Non-contrast CT (NCCT) suffers from low contrast and poor signal-to-noise ratio, severely impeding early and accurate stroke diagnosis. To address this, we propose the first application of the self-supervised vision Transformer DINOv3 to multi-task stroke analysis, establishing a unified framework that jointly performs infarct/hemorrhage segmentation, abnormality classification, hemorrhage subtype identification, and ASPECTS scoring. Our method leverages DINOv3 to extract robust high-level features and integrates them into a multi-task learning architecture. We comprehensively evaluate the framework across multiple public and private NCCT datasets. Results demonstrate substantial improvements in lesion detection and classification performance on low-quality NCCT scans, establishing a new state-of-the-art baseline. Furthermore, we publicly release our code and benchmark suite to accelerate clinical translation of self-supervised learning in neuroimaging.
π Abstract
Non-contrast computed tomography (NCCT) is essential for rapid stroke diagnosis but is limited by low image contrast and signal to noise ratio. We address this challenge by leveraging DINOv3, a state-of-the-art self-supervised vision transformer, to generate powerful feature representations for a comprehensive set of stroke analysis tasks. Our evaluation encompasses infarct and hemorrhage segmentation, anomaly classification (normal vs. stroke and normal vs. infarct vs. hemorrhage), hemorrhage subtype classification (EDH, SDH, SAH, IPH, IVH), and dichotomized ASPECTS classification (<=6 vs. >6) on multiple public and private datasets. This study establishes strong benchmarks for these tasks and demonstrates the potential of advanced self-supervised models to improve automated stroke diagnosis from NCCT, providing a clear analysis of both the advantages and current constraints of the approach. The code is available at https://github.com/Zzz0251/DINOv3-stroke.