Graph-based Confidence Calibration for Large Language Models

📅 2024-11-03

🏛️ arXiv.org

📈 Citations: 1

✨ Influential: 0

career value

158K/year

🤖 AI Summary

Unreliable confidence estimation of large language model (LLM) responses hinders their trustworthy deployment in high-stakes applications. To address this, we propose Graph Self-Consistency (GSC), the first method that formalizes self-consistency as a multi-response consistency graph, where candidate responses serve as nodes. Leveraging graph neural networks (GNNs), GSC performs fine-grained, unsupervised probabilistic correctness assessment of each node—enabling label-free confidence calibration. Crucially, GSC transforms the inherently unstructured self-consistency paradigm into a learnable graph representation and realizes end-to-end confidence modeling via GNNs. The approach exhibits strong out-of-domain generalization: it improves calibration across multiple benchmarks, reducing Expected Calibration Error (ECE) by over 35% on average, while maintaining robustness on unseen domain tasks.

Technology Category

Application Category

📝 Abstract

Reliable confidence estimation is essential for enhancing the trustworthiness of large language models (LLMs), especially in high-stakes scenarios. Despite its importance, accurately estimating confidence in LLM responses remains a significant challenge. In this work, we propose using an auxiliary learning model to assess response correctness based on the self-consistency of multiple outputs generated by the LLM. Our method builds a consistency graph to represent the agreement among multiple responses and uses a graph neural network (GNN) to estimate the likelihood that each response is correct. Experiments demonstrate that this method has strong calibration performance on various benchmark datasets and generalizes well to out-of-domain cases.

Problem

Research questions and friction points this paper is trying to address.

Estimating confidence in large language model responses accurately

Enhancing trustworthiness via self-consistency of multiple LLM outputs

Calibrating confidence using graph neural networks on consistency graphs

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses auxiliary model for response correctness assessment

Builds consistency graph from multiple LLM outputs

Applies GNN to estimate response correctness likelihood

🔎 Similar Papers

Calibration in Deep Learning: A Survey of the State-of-the-Art