đ¤ AI Summary
Existing 3D denoising diffusion probabilistic models (DDPMs) for brain MRI typically require preprocessingâsuch as skull-stripping, spatial registration, and latent-space compressionâlimiting anatomical fidelity and generalizability to real-world imaging variability.
Method: We introduce the first 3D DDPM trained directly on raw, unprocessed, unregistered T1-weighted brain MRI scansâincluding inherent anatomical diversity and magnetic field inhomogeneitiesâwithout latent-space compression. We benchmark three DDPM variantsâscore-based, velocity-based, and flow-basedâon a large-scale dataset of 82,000 scans from BraTS and ADNI.
Contribution/Results: Quantitative evaluation (FID, segmentation volume distributions) and qualitative analysis demonstrate that all models generate anatomically coherent 3D brain volumes. Velocity- and flow-based models achieve significantly lower FID (â12.4%) and better match ground-truth thalamic and lentiform nucleus volume distributions (KL divergence reduced by 0.18). All model weights and inference code are publicly released.
đ Abstract
The purpose of this study is to present and compare three denoising diffusion probabilistic models (DDPMs) that generate 3D $T_1$-weighted MRI human brain images. Three DDPMs were trained using 80,675 image volumes from 42,406 subjects spanning 38 publicly available brain MRI datasets. These images had approximately 1 mm isotropic resolution and were manually inspected by three human experts to exclude those with poor quality, field-of-view issues, and excessive pathology. The images were minimally preprocessed to preserve the visual variability of the data. Furthermore, to enable the DDPMs to produce images with natural orientation variations and inhomogeneity, the images were neither registered to a common coordinate system nor bias field corrected. Evaluations included segmentation, Frechet Inception Distance (FID), and qualitative inspection. Regarding results, all three DDPMs generated coherent MR brain volumes. The velocity and flow prediction models achieved lower FIDs than the sample prediction model. However, all three models had higher FIDs compared to real images across multiple cohorts. In a permutation experiment, the generated brain regional volume distributions differed statistically from real data. However, the velocity and flow prediction models had fewer statistically different volume distributions in the thalamus and putamen. In conclusion this work presents and releases the first 3D non-latent diffusion model for brain data without skullstripping or registration. Despite the negative results in statistical testing, the presented DDPMs are capable of generating high-resolution 3D $T_1$-weighted brain images. All model weights and corresponding inference code are publicly available at https://github.com/piksl-research/medforj .