Systematic Evaluation of Knowledge Graph Repair with Large Language Models

📅 2025-07-30

📈 Citations: 0

✨ Influential: 0

career value

142K/year

🤖 AI Summary

Existing knowledge graph (KG) repair evaluation methods rely on dataset-specific benchmarks, suffering from poor reproducibility and limited generalizability. To address SHACL constraint violation repair, this paper introduces the first systematic evaluation framework: it designs a violation-induction mechanism to generate diverse, controllable, and reproducible constraint violation scenarios; and conducts end-to-end repair experiments leveraging large language models (LLMs) with multi-strategy prompt engineering. Our key innovation lies in the tight integration of SHACL semantics, graph-structural context, and prompt design. We demonstrate that prompts incorporating critical constraints and distilled contextual information significantly improve repair accuracy—achieving an average +23.6% gain over baselines. This work establishes a scalable, verifiable, and reproducible evaluation paradigm for KG repair, advancing both methodological rigor and practical applicability in constraint-aware KG maintenance.

Technology Category

Application Category

📝 Abstract

We present a systematic approach for evaluating the quality of knowledge graph repairs with respect to constraint violations defined in shapes constraint language (SHACL). Current evaluation methods rely on emph{ad hoc} datasets, which limits the rigorous analysis of repair systems in more general settings. Our method addresses this gap by systematically generating violations using a novel mechanism, termed violation-inducing operations (VIOs). We use the proposed evaluation framework to assess a range of repair systems which we build using large language models. We analyze the performance of these systems across different prompting strategies. Results indicate that concise prompts containing both the relevant violated SHACL constraints and key contextual information from the knowledge graph yield the best performance.

Problem

Research questions and friction points this paper is trying to address.

Evaluating knowledge graph repair quality systematically

Addressing limitations of ad hoc evaluation datasets

Assessing repair systems using large language models

Innovation

Methods, ideas, or system contributions that make the work stand out.

Systematic violation generation using VIOs

Evaluation framework for KG repair systems

Optimal prompts with SHACL constraints

🔎 Similar Papers

Exploring Large Language Models for Knowledge Graph Completion

2023-08-26arXiv.orgCitations: 28

💼 Related Jobs

No related jobs found.

Authors to Follow