Less is More for RAG: Information Gain Pruning for Generator-Aligned Reranking and Evidence Selection

📅 2026-01-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the weak or even negative correlation between traditional retrieval relevance metrics and answer quality under limited context budgets, which hinders effective evidence selection in retrieval-augmented generation (RAG) systems. To overcome this, the authors propose Information Gain Pruning (IGP), a novel approach that introduces generator-aligned utility signals to rerank and prune retrieved passages, eliminating redundant or harmful evidence. IGP requires no modification to existing context interfaces, ensuring straightforward deployment. Evaluated on five open-domain question answering benchmarks, IGP improves average F1 scores by 12%–20% while reducing input token consumption by 76%–79%, substantially enhancing the trade-off between generation quality and computational cost.

Technology Category

Application Category

📝 Abstract
Retrieval-augmented generation (RAG) grounds large language models with external evidence, but under a limited context budget, the key challenge is deciding which retrieved passages should be injected. We show that retrieval relevance metrics (e.g., NDCG) correlate weakly with end-to-end QA quality and can even become negatively correlated under multi-passage injection, where redundancy and mild conflicts destabilize generation. We propose \textbf{Information Gain Pruning (IGP)}, a deployment-friendly reranking-and-pruning module that selects evidence using a generator-aligned utility signal and filters weak or harmful passages before truncation, without changing existing budget interfaces. Across five open-domain QA benchmarks and multiple retrievers and generators, IGP consistently improves the quality--cost trade-off. In a representative multi-evidence setting, IGP delivers about +12--20% relative improvement in average F1 while reducing final-stage input tokens by roughly 76--79% compared to retriever-only baselines.
Problem

Research questions and friction points this paper is trying to address.

Retrieval-Augmented Generation
Evidence Selection
Context Budget
Redundancy
Generation Quality
Innovation

Methods, ideas, or system contributions that make the work stand out.

Information Gain Pruning
Retrieval-Augmented Generation
Generator-Aligned Reranking
Evidence Selection
Context Budget Optimization
🔎 Similar Papers
No similar papers found.
Z
Zhipeng Song
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China
Yizhi Zhou
Yizhi Zhou
George mason university
RoboticsSLAMState Estimation
Xiangyu Kong
Xiangyu Kong
Beijing Information Science & Technology University
Audio/Speech ProcessingReinforcement LearningComputer Vision
J
Jiulong Jiao
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China
X
Xinrui Bao
School of Electronic and Information Engineering, Liaoning Technical University, 188 Longwan South Street, Sijiatun District, Huludao, 125105, China
X
Xu You
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China
X
Xueqing Shi
College of Health-Preservation and Wellness, Dalian Medical University, No. 9 West Section of Lvshun South Road, Lvshunkou District, Dalian, 116044, China
Y
Yuhang Zhou
Tencent (Dalian Northern Interactive Entertainment Technology Co., Ltd.), 21/F, Tencent Building, No. 26 Jingxian St, Ganjingzi District, Dalian, 116085, China
H
Heng Qi
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China