Semantic Data Augmentation Enhanced Invariant Risk Minimization for Medical Image Domain Generalization

📅 2025-02-08
📈 Citations: 0
Influential: 0
📄 PDF

career value

194K/year
🤖 AI Summary
Medical image generalization across multi-center clinical sites is hindered by distribution shifts arising from heterogeneous imaging protocols, scanner hardware, and operator practices—exacerbated by scarce annotated data, impeding deep learning deployment. To address this, we propose a semantic-guided, domain-directed data augmentation framework embedded within the Invariant Risk Minimization (IRM) paradigm, jointly enforcing semantic alignment and distributional discrepancy reduction. Our key contribution is the first introduction of a cross-domain covariance-guided augmentation direction selection mechanism—replacing conventional random augmentation—to enhance IRM’s robustness in medical imaging. The method integrates cross-domain covariance modeling, semantic-aware augmentation, and multi-center representation learning. Evaluated on a multi-center diabetic retinopathy dataset under challenging conditions (few-shot setting: <100 samples per site; high inter-site heterogeneity), our approach achieves a 5.2% absolute accuracy improvement over state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
Deep learning has achieved remarkable success in medical image classification. However, its clinical application is often hindered by data heterogeneity caused by variations in scanner vendors, imaging protocols, and operators. Approaches such as invariant risk minimization (IRM) aim to address this challenge of out-of-distribution generalization. For instance, VIRM improves upon IRM by tackling the issue of insufficient feature support overlap, demonstrating promising potential. Nonetheless, these methods face limitations in medical imaging due to the scarcity of annotated data and the inefficiency of augmentation strategies. To address these issues, we propose a novel domain-oriented direction selector to replace the random augmentation strategy used in VIRM. Our method leverages inter-domain covariance as a guider for augmentation direction, guiding data augmentation towards the target domain. This approach effectively reduces domain discrepancies and enhances generalization performance. Experiments on a multi-center diabetic retinopathy dataset demonstrate that our method outperforms state-of-the-art approaches, particularly under limited data conditions and significant domain heterogeneity.
Problem

Research questions and friction points this paper is trying to address.

Enhance medical image domain generalization
Address data heterogeneity in medical imaging
Improve invariant risk minimization with semantic augmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantic Data Augmentation
Domain-Oriented Direction Selector
Inter-Domain Covariance Guider
🔎 Similar Papers
Y
Yaoyao Zhu
Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu, 610213, China; University of Chinese Academy of Sciences, Beijing, 101408, China
Xiuding Cai
Xiuding Cai
University of Chinese Academy of Sciences
Computer VisonMachine Learning
Y
Yingkai Wang
Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu, 610213, China; University of Chinese Academy of Sciences, Beijing, 101408, China
Y
Yu Yao
Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu, 610213, China; University of Chinese Academy of Sciences, Beijing, 101408, China
Xu Luo
Xu Luo
UESTC
Machine LearningRobotics
Z
Zhongliang Fu
Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu, 610213, China; University of Chinese Academy of Sciences, Beijing, 101408, China