Noise or Nuance: An Investigation Into Useful Information and Filtering For LLM Driven AKBC

📅 2025-09-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In constrained settings such as LM-KBC, triplet completion faces three key challenges: low-quality generation, difficulty in filtering spurious triplets, and non-robust parsing of LLM outputs. Method: This paper proposes a lightweight, RAG-free, and fine-tuning-free framework leveraging large language models (LLMs). It enhances information input to improve generation quality, exploits the LLM’s intrinsic discriminative capability for dynamic quality filtering, and introduces a configurable parsing strategy that balances flexibility and consistency according to task requirements. Contribution/Results: Experiments demonstrate significant improvements in completion accuracy and robustness. The study empirically delineates the applicability boundaries of different parsing strategies, offering a new paradigm for automated knowledge base construction (AKBC) that is efficient, interpretable, and deployment-friendly.

Technology Category

Application Category

📝 Abstract
RAG and fine-tuning are prevalent strategies for improving the quality of LLM outputs. However, in constrained situations, such as that of the 2025 LM-KBC challenge, such techniques are restricted. In this work we investigate three facets of the triple completion task: generation, quality assurance, and LLM response parsing. Our work finds that in this constrained setting: additional information improves generation quality, LLMs can be effective at filtering poor quality triples, and the tradeoff between flexibility and consistency with LLM response parsing is setting dependent.
Problem

Research questions and friction points this paper is trying to address.

Investigating useful information filtering for LLM-driven knowledge base completion
Evaluating generation quality assurance in constrained AKBC settings
Analyzing tradeoffs between flexibility and consistency in LLM response parsing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generation enhancement with additional information
LLM filtering for poor quality triples
Context-dependent LLM response parsing tradeoffs
🔎 Similar Papers
No similar papers found.
A
Alex Clay
City St George’s, University of London, Northampton Square, London, EC1V 0HB, United Kingdom
E
Ernesto Jiménez-Ruiz
City St George’s, University of London, Northampton Square, London, EC1V 0HB, United Kingdom
Pranava Madhyastha
Pranava Madhyastha
City, University of London; Imperial College London; Alan Turing Institute
Machine LearningNatural Language ProcessingComputer VisionSpeech Processing