A Variational Approach for Mitigating Entity Bias in Relation Extraction

📅 2025-06-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Relation extraction (RE) models suffer from entity bias, over-relying on superficial entity surface forms and exhibiting poor generalization. To address this, we propose the first application of the Variational Information Bottleneck (VIB) to RE, explicitly disentangling entity-specific information in the latent space while preserving relation-discriminative features. We further integrate adversarial entity-agnostic representation learning to enhance robustness against entity perturbations. Our approach is theoretically grounded, interpretable, and inherently adaptable across domains. Extensive experiments demonstrate state-of-the-art performance on three major RE benchmarks—general-domain, financial, and biomedical—achieving superior in-domain accuracy and exceptional cross-domain generalization under type-constrained substitutions (e.g., entity swapping and domain transfer), significantly outperforming existing methods.

Technology Category

Application Category

📝 Abstract
Mitigating entity bias is a critical challenge in Relation Extraction (RE), where models often rely excessively on entities, resulting in poor generalization. This paper presents a novel approach to address this issue by adapting a Variational Information Bottleneck (VIB) framework. Our method compresses entity-specific information while preserving task-relevant features. It achieves state-of-the-art performance on relation extraction datasets across general, financial, and biomedical domains, in both indomain (original test sets) and out-of-domain (modified test sets with type-constrained entity replacements) settings. Our approach offers a robust, interpretable, and theoretically grounded methodology.
Problem

Research questions and friction points this paper is trying to address.

Mitigating entity bias in Relation Extraction models
Compressing entity info while keeping task features
Improving generalization across multiple domain datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Variational Information Bottleneck framework
Compresses entity-specific information
Preserves task-relevant features
🔎 Similar Papers
No similar papers found.