Debiasing Cardiac Imaging with Controlled Latent Diffusion Models

📅 2024-03-28
🏛️ arXiv.org
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
Cardiac MRI data exhibit severe distributional bias and class imbalance due to sensitive attributes—including sex, age, BMI, and health status—hindering fair and robust model training. Method: This work introduces, for the first time, a ControlNet-augmented latent diffusion model (LDM) for medical image debiasing. It enables fine-grained, sensitive-attribute-controllable synthesis by jointly conditioning on patient metadata and geometry-aware encodings derived from cardiac segmentation masks. The entire pipeline is trained end-to-end on a single consumer-grade GPU. Contribution/Results: Synthesized data substantially improve coverage of underrepresented subgroups—particularly young patients and heart failure patients with normal BMI. Downstream classifiers achieve a 12.7% absolute accuracy gain on minority groups and reduce inter-group performance disparity by 38%. With an FID of 14.2, generated images meet clinical usability standards, demonstrating both fairness enhancement and practical deployability.

Technology Category

Application Category

📝 Abstract
The progress in deep learning solutions for disease diagnosis and prognosis based on cardiac magnetic resonance imaging is hindered by highly imbalanced and biased training data. To address this issue, we propose a method to alleviate imbalances inherent in datasets through the generation of synthetic data based on sensitive attributes such as sex, age, body mass index, and health condition. We adopt ControlNet based on a denoising diffusion probabilistic model to condition on text assembled from patient metadata and cardiac geometry derived from segmentation masks using a large-cohort study, specifically, the UK Biobank. We assess our method by evaluating the realism of the generated images using established quantitative metrics. Furthermore, we conduct a downstream classification task aimed at debiasing a classifier by rectifying imbalances within underrepresented groups through synthetically generated samples. Our experiments demonstrate the effectiveness of the proposed approach in mitigating dataset imbalances, such as the scarcity of younger patients or individuals with normal BMI level suffering from heart failure. This work represents a major step towards the adoption of synthetic data for the development of fair and generalizable models for medical classification tasks. Notably, we conduct all our experiments using a single, consumer-level GPU to highlight the feasibility of our approach within resource-constrained environments. Our code is available at https://github.com/faildeny/debiasing-cardiac-mri.
Problem

Research questions and friction points this paper is trying to address.

Addressing imbalanced cardiac MRI datasets using synthetic data generation
Mitigating biases in medical imaging with text-conditioned diffusion models
Improving classifier fairness through debiasing underrepresented patient groups
Innovation

Methods, ideas, or system contributions that make the work stand out.

Text-conditioned diffusion models for synthetic data
ControlNet with patient metadata and geometry conditioning
Debiasing classifiers via synthetic sample augmentation
🔎 Similar Papers
No similar papers found.
Grzegorz Skorupko
Grzegorz Skorupko
Universitat de Barcelona
medical imagingcomputer visionmachine learning
Richard Osuala
Richard Osuala
University of Barcelona
Medical Image AnalysisGenerative ModelsComputer VisionDeep Learning
Zuzanna Szafranowska
Zuzanna Szafranowska
Barcelona Artificial Intelligence in Medicine Lab (BCN-AIM), Facultat de Matemàtiques i Informàtica, Universitat de Barcelona, Spain
Kaisar Kushibar
Kaisar Kushibar
University of Barcelona
Medical Image AnalysisDeep LearningComputer Vision
N
N. Aung
William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University London, Charterhouse Square, London, UK
S
Steffen E. Petersen
William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University London, Charterhouse Square, London, UK
K
K. Lekadir
Barcelona Artificial Intelligence in Medicine Lab (BCN-AIM), Facultat de Matemàtiques i Informàtica, Universitat de Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluís Companys 23, Barcelona, Spain
P
P. Gkontra
Barcelona Artificial Intelligence in Medicine Lab (BCN-AIM), Facultat de Matemàtiques i Informàtica, Universitat de Barcelona, Spain