Mitigating Shortcut Learning with InterpoLated Learning

📅 2025-07-07

📈 Citations: 0

✨ Influential: 0

career value

158K/year

🤖 AI Summary

Under empirical risk minimization (ERM), models often rely on spurious correlations—so-called “shortcuts”—leading to poor generalization on minority subgroups. To address this, we propose InterpoLated Learning (InterpoLL): a semantic-aware interpolation technique applied directly in the representation space, which blends features of majority and minority instances from the same class to explicitly incorporate minority patterns and attenuate shortcut reliance. InterpoLL operates at the feature level without modifying model architecture or introducing additional hyperparameters, enabling cross-group robust representation learning. Experiments across multiple natural language understanding benchmarks demonstrate that InterpoLL significantly improves accuracy on minority-group examples while preserving performance on majority-group examples. It consistently outperforms standard ERM and state-of-the-art debiasing methods—including those designed to mitigate shortcut learning—across diverse fairness-sensitive evaluation metrics.

Technology Category

Application Category

📝 Abstract

Empirical risk minimization (ERM) incentivizes models to exploit shortcuts, i.e., spurious correlations between input attributes and labels that are prevalent in the majority of the training data but unrelated to the task at hand. This reliance hinders generalization on minority examples, where such correlations do not hold. Existing shortcut mitigation approaches are model-specific, difficult to tune, computationally expensive, and fail to improve learned representations. To address these issues, we propose InterpoLated Learning (InterpoLL) which interpolates the representations of majority examples to include features from intra-class minority examples with shortcut-mitigating patterns. This weakens shortcut influence, enabling models to acquire features predictive across both minority and majority examples. Experimental results on multiple natural language understanding tasks demonstrate that InterpoLL improves minority generalization over both ERM and state-of-the-art shortcut mitigation methods, without compromising accuracy on majority examples. Notably, these gains persist across encoder, encoder-decoder, and decoder-only architectures, demonstrating the method's broad applicability.

Problem

Research questions and friction points this paper is trying to address.

Mitigates shortcut learning in models exploiting spurious correlations

Improves generalization on minority examples without majority accuracy loss

Proposes InterpoLL for broad applicability across diverse architectures

Innovation

Methods, ideas, or system contributions that make the work stand out.

Interpolates representations to include minority features

Weakens shortcut influence for better generalization

Applicable across diverse model architectures

🔎 Similar Papers

The dynamic interplay between in-context and in-weight learning in humans and neural networks

2024-02-13Citations: 1

A Role of Environmental Complexity on Representation Learning in Deep Reinforcement Learning Agents

2024-07-03arXiv.orgCitations: 1

TikTok

Seattle, Washington

Authors to Follow