DISCIE-Discriminative Closed Information Extraction

📅 2025-06-19

🏛️ International Workshop on the Semantic Web

📈 Citations: 0

✨ Influential: 0

career value

157K/year

🤖 AI Summary

To address the accuracy bottleneck in large-scale closed-domain information extraction—caused by millions of entities and hundreds of relations (especially long-tail ones)—this paper proposes a discriminative relation extraction framework. The method introduces type-aware embeddings that jointly encode entity types and relation-specific semantics, and designs an entity-relation joint scoring mechanism, enabling end-to-end discriminative modeling within a lightweight neural architecture. Compared to state-of-the-art generative approaches, our method achieves significantly higher F1 scores for both overall and long-tail relations while maintaining superior inference efficiency: average F1 improves by 4.2% and long-tail relation F1 by 9.7% on mainstream closed-domain benchmarks. Our core contribution is the first systematic integration of type-guided discriminative modeling into large-scale closed-domain extraction, achieving accuracy and generalization surpassing large language models with substantially fewer parameters.

Technology Category

Application Category

📝 Abstract

This paper introduces a novel method for closed information extraction. The method employs a discriminative approach that incorporates type and entity-specific information to improve relation extraction accuracy, particularly benefiting long-tail relations. Notably, this method demonstrates superior performance compared to state-of-the-art end-to-end generative models. This is especially evident for the problem of large-scale closed information extraction where we are confronted with millions of entities and hundreds of relations. Furthermore, we emphasize the efficiency aspect by leveraging smaller models. In particular, the integration of type-information proves instrumental in achieving performance levels on par with or surpassing those of a larger generative model. This advancement holds promise for more accurate and efficient information extraction techniques.

Problem

Research questions and friction points this paper is trying to address.

Improves relation extraction accuracy using discriminative approach

Addresses large-scale closed information extraction challenges

Enhances efficiency with smaller models and type-information

Innovation

Methods, ideas, or system contributions that make the work stand out.

Discriminative approach with type and entity info

Efficient smaller models outperform larger ones

Superior accuracy for long-tail relations

🔎 Similar Papers

No similar papers found.