HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models

📅 2025-03-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) frequently generate hallucinations—factually incorrect or contextually inconsistent outputs. To address this, we propose *Induced Contrastive Decoding* (ICD), a novel paradigm that first identifies and selectively attenuates attention in critical heads based on predictive importance, thereby controllably inducing high-quality hallucinated samples; it then performs contrastive decoding between the induced hallucination and the original generation to enhance factual fidelity. ICD is the first method enabling head-level, interpretable, and controllable hallucination induction without external knowledge bases or model fine-tuning. Evaluated on context completion, reading comprehension, and question answering tasks, ICD significantly improves factual accuracy and contextual faithfulness, outperforming state-of-the-art hallucination mitigation and contrastive decoding approaches.

Technology Category

Application Category

📝 Abstract
Large Language Models (LLMs) often generate hallucinations, producing outputs that are contextually inaccurate or factually incorrect. We introduce HICD, a novel method designed to induce hallucinations for contrastive decoding to mitigate hallucinations. Unlike existing contrastive decoding methods, HICD selects attention heads crucial to the model's prediction as inducing heads, then induces hallucinations by dispersing attention of these inducing heads and compares the hallucinated outputs with the original outputs to obtain the final result. Our approach significantly improves performance on tasks requiring contextual faithfulness, such as context completion, reading comprehension, and question answering. It also improves factuality in tasks requiring accurate knowledge recall. We demonstrate that our inducing heads selection and attention dispersion method leads to more"contrast-effective"hallucinations for contrastive decoding, outperforming other hallucination-inducing methods. Our findings provide a promising strategy for reducing hallucinations by inducing hallucinations in a controlled manner, enhancing the performance of LLMs in a wide range of tasks.
Problem

Research questions and friction points this paper is trying to address.

Mitigates hallucinations in Large Language Models (LLMs).
Introduces HICD for contrastive decoding via attention dispersion.
Improves contextual faithfulness and factuality in LLM outputs.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Induces hallucinations via attention dispersion
Selects crucial attention heads for contrastive decoding
Improves contextual faithfulness and factuality
🔎 Similar Papers
No similar papers found.
X
Xinyan Jiang
Shanghai Advanced Research Institute, Chinese Academy of Sciences
H
Hang Ye
Shanghai Advanced Research Institute, Chinese Academy of Sciences
Yongxin Zhu
Yongxin Zhu
Professor, Chinese Academy of Sciences; Adjunct Professor, Shanghai Jiao Tong University;
zhuyongxin@sari.ac.cnzhuyongxin@sjtu.edu.cn
X
Xiaoying Zheng
Shanghai Advanced Research Institute, Chinese Academy of Sciences
Z
Zikang Chen
Shanghai Advanced Research Institute, Chinese Academy of Sciences
Jun Gong
Jun Gong
Shanghai Advanced Research Institute, Chinese Academy of Sciences