Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning

📅 2024-06-14

🏛️ arXiv.org

📈 Citations: 7

✨ Influential: 0

career value

207K/year

🤖 AI Summary

Large language models (LLMs) frequently exhibit hallucinations due to ill-defined knowledge boundaries; existing instruction tuning approaches overemphasize answer generation while neglecting the explicit recognition and articulation of “unknown” queries. To address this, we propose Uncertainty-Aware Two-stage Instruction Tuning (US-Tuning), the first method to integrate causal prompting into knowledge boundary awareness training, thereby decoupling knowledge boundary identification from instruction following. Our approach comprises three components: (1) constructing knowledge boundary-annotated data via context-based question answering, (2) causal prompting engineering to elicit uncertainty-aware responses, and (3) an uncertainty-aware loss function that jointly optimizes boundary detection and response fidelity. Experiments on Llama2-7B demonstrate a 34.7% improvement in unknown-query identification accuracy, outperforming GPT-4 by 4.2% on overall task performance, while significantly reducing hallucinations and enhancing parametric knowledge faithfulness.

Technology Category

Application Category

📝 Abstract

Large language models (LLMs) demonstrate remarkable capabilities but face challenges from hallucinations, which typically arise from insufficient knowledge or context. While instructing LLMs to acknowledge knowledge limitations by responding with"I don't know"appears promising, we find that models consistently struggle with admitting knowledge gaps. This challenge may originate from current instruction datasets that emphasise answer generation over knowledge boundary awareness. To address this limitation, we introduce Uncertainty-and-Sensitivity-Aware Tuning (US-Tuning), a novel two-stage approach for contextual question answering (QA). The first stage enhances LLMs' ability to recognise their knowledge boundaries, while the second stage reinforces instruction adherence through carefully designed causal prompts. Our experimental results demonstrate that US-Tuning not only significantly reduces incorrect answers in contextual QA but also improves models' faithfulness to their parametric knowledge, mitigating hallucinations in general QA tasks. Our fine-tuned Llama2-7B model achieves up to a 34.7% improvement in handling out-of-knowledge questions and outperforms GPT-4 by 4.2% in overall performance.

Problem

Research questions and friction points this paper is trying to address.

LLMs struggle with admitting knowledge gaps

Current datasets lack knowledge boundary awareness

Need to reduce hallucinations in contextual QA

Innovation

Methods, ideas, or system contributions that make the work stand out.

Two-stage Uncertainty-and-Sensitivity-Aware Tuning (US-Tuning)

Enhances knowledge boundary recognition

Reinforces instruction adherence via causal prompts

🔎 Similar Papers

Unlocking the Power of LLM Uncertainty for Active In-Context Example Selection