🤖 AI Summary
This paper identifies the root cause of spurious feature reliance—e.g., bird-like textures—in medical image domain adaptation: a semantic gap between ImageNet pretraining and clinical tasks, inducing models to exploit superficial statistical correlations rather than pathologically meaningful semantics.
Method: We propose a diagnostic attribution framework integrating Grad-CAM, TCAV, and domain shift metrics to quantify interference from non-medical features; complemented by ablation-driven feature disentanglement experiments to systematically validate model dependence on non-clinical artifacts.
Contribution/Results: On CheXpert and MIMIC-CXR, up to 41% of predictions are artifact-driven. Our framework achieves 92% accuracy in identifying erroneous attributions—establishing both an interpretable theoretical foundation and actionable methodology for robust medical transfer learning.