LANCET: Neural Intervention via Structural Entropy for Mitigating Faithfulness Hallucinations in LLMs

πŸ“… 2026-01-04
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge of hallucination in large language models, which arises from the distributed nature of neural information propagation and remains inadequately mitigated by existing intervention methods. The authors introduce structural entropy into neural pathway analysis for the first time, leveraging a hallucination discrepancy ratio and gradient-driven contrast to precisely identify and map forward-propagation paths associated with hallucination-prone neurons. Building on this insight, they propose a hierarchical, surgery-like neural intervention mechanism that transcends the limitations of conventional node-level or coarse-grained suppression strategies. Evaluated across multiple hallucination benchmarks, the method significantly outperforms state-of-the-art approaches, effectively curbing hallucinations while preserving the model’s general capabilities.

Technology Category

Application Category

πŸ“ Abstract
Large Language Models have revolutionized information processing, yet their reliability is severely compromised by faithfulness hallucinations. While current approaches attempt to mitigate this issue through node-level adjustments or coarse suppression, they often overlook the distributed nature of neural information, leading to imprecise interventions. Recognizing that hallucinations propagate through specific forward transmission pathways like an infection, we aim to surgically block this flow using precise structural analysis. To leverage this, we propose Lancet, a novel framework that achieves precise neural intervention by leveraging structural entropy and hallucination difference ratios. Lancet first locates hallucination-prone neurons via gradient-driven contrastive analysis, then maps their propagation pathways by minimizing structural entropy, and finally implements a hierarchical intervention strategy that preserves general model capabilities. Comprehensive evaluations across hallucination benchmark datasets demonstrate that Lancet significantly outperforms state-of-the-art methods, validating the effectiveness of our surgical approach to neural intervention.
Problem

Research questions and friction points this paper is trying to address.

faithfulness hallucinations
neural intervention
structural entropy
Large Language Models
hallucination propagation
Innovation

Methods, ideas, or system contributions that make the work stand out.

structural entropy
neural intervention
faithfulness hallucinations
gradient-driven contrastive analysis
hierarchical intervention
Chenxu Wang
Chenxu Wang
University of Science and Technology of China
Chaozhuo Li
Chaozhuo Li
Microsoft Research Aisa
P
Pengbo Wang
Beijing University of Posts and Telecommunications, Beijing, China
Litian Zhang
Litian Zhang
Beihang University
S
Songyang Liu
Beijing University of Posts and Telecommunications, Beijing, China
J
Ji Qi
Beijing University of Posts and Telecommunications, Beijing, China
Jiahui Hu
Jiahui Hu
Postdoctoral researcher, Embry-Riddle Aeronautical University
Machine learningdata assimilationatmospheric scienceionosphere
Y
Yushan Cai
China National Petroleum Corporation, Beijing, China
Hao Zhao
Hao Zhao
China Europe International Business School
entrepreneurshipleadershiprecruitmentinnovationartificial intelligence
R
Rui Pu
Beijing University of Posts and Telecommunications, Beijing, China