Disentangling representations of retinal images with generative models

📅 2024-02-29

🏛️ Medical Image Analysis

📈 Citations: 3

✨ Influential: 0

career value

204K/year

🤖 AI Summary

In fundus image analysis, technical confounders—such as camera model—induce spurious correlations that impair AI model generalizability. To address this, we propose the first generative demographic disentanglement framework for fundus images, introducing a novel distance correlation–based disentanglement loss to rigorously separate pathological semantics from device-specific effects in the latent space. Built upon a GAN architecture, our method enables independent and controllable generation of disease status and camera type. Extensive evaluation across multiple real-world datasets demonstrates significant improvement in latent representation disentanglement and effective suppression of device-induced bias. Generated images achieve high fidelity, with Fréchet Inception Distance (FID) ≤ 12.3. Our implementation is publicly available.

Technology Category

Application Category

📝 Abstract

Retinal fundus images play a crucial role in the early detection of eye diseases. However, the impact of technical factors on these images can pose challenges for reliable AI applications in ophthalmology. For example, large fundus cohorts are often confounded by factors like camera type, bearing the risk of learning shortcuts rather than the causal relationships behind the image generation process. Here, we introduce a population model for retinal fundus images that effectively disentangles patient attributes from camera effects, enabling controllable and highly realistic image generation. To achieve this, we propose a disentanglement loss based on distance correlation. Through qualitative and quantitative analyses, we show that our models encode desired information in disentangled subspaces and enable controllable image generation based on the learned subspaces, demonstrating the effectiveness of our disentanglement loss. The project's code is publicly available: https://github.com/berenslab/disentangling-retinal-images.

Problem

Research questions and friction points this paper is trying to address.

Disentangle patient attributes from camera effects in retinal images

Prevent AI from learning shortcuts due to technical factors

Enable controllable and realistic retinal image generation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Disentangles patient attributes from camera effects

Uses disentanglement loss based on distance correlation

Enables controllable realistic retinal image generation

🔎 Similar Papers

No similar papers found.