Conformal Path Reasoning: Trustworthy Knowledge Graph Question Answering via Path-Level Calibration

📅 2026-05-08

📈 Citations: 0

✨ Influential: 0

career value

122K/year

🤖 AI Summary

Existing knowledge graph question answering methods lack statistical coverage guarantees, and traditional conformal prediction approaches suffer from calibration failure and insufficient discriminability of path scores. To address these limitations, this work proposes a path-level conformal calibration mechanism coupled with a Residual Conformal Value Network (RCVNet) trained via PUCT-based exploration. The approach learns highly discriminative nonconformity scores while satisfying the exchangeability assumption, thereby enabling query-level conformal prediction. Empirical evaluation demonstrates that the method substantially improves both the compactness and coverage of predicted answer sets, achieving a 34% increase in empirical coverage and a 40% reduction in average prediction set size on benchmark datasets, significantly outperforming existing conformal baselines.

📝 Abstract

Knowledge Graph Question Answering (KGQA) has shown promise for grounded and interpretable reasoning, yet existing approaches often fail to provide reliable coverage guarantees over retrieved answers. While Conformal Prediction (CP) offers a principled framework for producing prediction sets with statistical guarantees, prior methods suffer from critical limitations in both calibration validity and score discriminability, resulting in violated coverage guarantees and excessively large prediction sets. To address these pitfalls, we propose Conformal Path Reasoning (CPR), a trustworthy KGQA framework with two key innovations. First, we perform query-level conformal calibration over path-level scores, preserving the exchangeability while generating path prediction sets. Second, we introduce the Residual Conformal Value Network (RCVNet), a lightweight module trained via PUCT-guided exploration to learn discriminative path-level nonconformity scores. Experiments on benchmarks show that CPR significantly improves the Empirical Coverage Rate by 34% while reducing average prediction set size by 40% compared to conformal baselines. These results validate the efficacy of CPR in satisfying coverage guarantees with substantially more compact answer sets.

Problem

Research questions and friction points this paper is trying to address.

Knowledge Graph Question Answering

Conformal Prediction

Coverage Guarantee

Prediction Set

Path Reasoning

Innovation

Methods, ideas, or system contributions that make the work stand out.

Conformal Prediction

Path-Level Calibration

Knowledge Graph Question Answering