Revisiting Invariant Learning for Out-of-Domain Generalization on Multi-Site Mammogram Datasets

📅 2025-03-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the out-of-distribution (OOD) generalization challenge in multi-center mammography analysis. For the first time, it systematically evaluates invariant learning methods—Invariant Risk Minimization (IRM) and Variance Regularized Empirical Risk Minimization (VRE)—on a real-world, publicly available multi-site mammography dataset for breast cancer risk estimation. Methodologically, IRM and VRE are adapted to whole-image classification tasks and benchmarked against standard Empirical Risk Minimization (ERM); interpretability is enhanced via Class Activation Mapping (CAM) and representation visualization. Results demonstrate that invariant learning significantly improves cross-site AUC and mean precision, effectively mitigating spurious correlations; however, performance remains limited at sites with small sample sizes. This work establishes the first empirical OOD generalization benchmark for medical imaging, validating the clinical applicability of invariant learning while clarifying its mechanistic advantages and practical limitations.

Technology Category

Application Category

📝 Abstract
Despite significant progress in robust deep learning techniques for mammogram breast cancer classification, their reliability in real-world clinical development settings remains uncertain. The translation of these models to clinical practice faces challenges due to variations in medical centers, imaging protocols, and patient populations. To enhance their robustness, invariant learning methods have been proposed, prioritizing causal factors over misleading features. However, their effectiveness in clinical development and impact on mammogram classification require investigation. This paper reassesses the application of invariant learning for breast cancer risk estimation based on mammograms. Utilizing diverse multi-site public datasets, it represents the first study in this area. The objective is to evaluate invariant learning's benefits in developing robust models. Invariant learning methods, including Invariant Risk Minimization and Variance Risk Extrapolation, are compared quantitatively against Empirical Risk Minimization. Evaluation metrics include accuracy, average precision, and area under the curve. Additionally, interpretability is examined through class activation maps and visualization of learned representations. This research examines the advantages, limitations, and challenges of invariant learning for mammogram classification, guiding future studies to develop generalized methods for breast cancer prediction on whole mammograms in out-of-domain scenarios.
Problem

Research questions and friction points this paper is trying to address.

Evaluates invariant learning for robust mammogram classification.
Compares invariant learning methods against traditional risk minimization.
Assesses generalizability of breast cancer prediction models.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Utilizes invariant learning for mammogram classification
Compares Invariant Risk Minimization with Empirical Risk
Evaluates robustness using multi-site mammogram datasets
🔎 Similar Papers
No similar papers found.
H
Hung Q. Vo
Department of Electrical and Computer Engineering, University of Houston, Houston, TX 77204 USA
S
Samira Zare
Department of Electrical and Computer Engineering, University of Houston, Houston, TX 77204 USA
S
Son T. Ly
Department of Electrical and Computer Engineering, University of Houston, Houston, TX 77204 USA
L
Lin Wang
Department of Systems Medicine and Biomedical Engineering, Houston Methodist Cancer Center, Houston, TX 77030 USA
C
Chika F. Ezeana
Department of Systems Medicine and Biomedical Engineering, Houston Methodist Cancer Center, Houston, TX 77030 USA
Xiaohui Yu
Xiaohui Yu
Professor, York University
data managementdata miningapplied machine learningurban computingsocial media analysis
Kelvin K. Wong
Kelvin K. Wong
Associate Research Professor of Radiology and Neurosurgery, Weill Cornell Medical College; Research
Machine LearningArtificial IntelligenceMedical Imaging
S
Stephen T.C. Wong
Department of Systems Medicine and Biomedical Engineering, Houston Methodist Cancer Center, Houston, TX 77030 USA
H
Hien V. Nguyen
Department of Electrical and Computer Engineering, University of Houston, Houston, TX 77204 USA