Less is More for RAG: Information Gain Pruning for Generator-Aligned Reranking and Evidence Selection

📅 2026-01-24

📈 Citations: 0

✨ Influential: 0

career value

187K/year

🤖 AI Summary

This work addresses the weak or even negative correlation between traditional retrieval relevance metrics and answer quality under limited context budgets, which hinders effective evidence selection in retrieval-augmented generation (RAG) systems. To overcome this, the authors propose Information Gain Pruning (IGP), a novel approach that introduces generator-aligned utility signals to rerank and prune retrieved passages, eliminating redundant or harmful evidence. IGP requires no modification to existing context interfaces, ensuring straightforward deployment. Evaluated on five open-domain question answering benchmarks, IGP improves average F1 scores by 12%–20% while reducing input token consumption by 76%–79%, substantially enhancing the trade-off between generation quality and computational cost.

Technology Category

Application Category

📝 Abstract

Retrieval-augmented generation (RAG) grounds large language models with external evidence, but under a limited context budget, the key challenge is deciding which retrieved passages should be injected. We show that retrieval relevance metrics (e.g., NDCG) correlate weakly with end-to-end QA quality and can even become negatively correlated under multi-passage injection, where redundancy and mild conflicts destabilize generation. We propose \textbf{Information Gain Pruning (IGP)}, a deployment-friendly reranking-and-pruning module that selects evidence using a generator-aligned utility signal and filters weak or harmful passages before truncation, without changing existing budget interfaces. Across five open-domain QA benchmarks and multiple retrievers and generators, IGP consistently improves the quality--cost trade-off. In a representative multi-evidence setting, IGP delivers about +12--20% relative improvement in average F1 while reducing final-stage input tokens by roughly 76--79% compared to retriever-only baselines.

Problem

Research questions and friction points this paper is trying to address.

Retrieval-Augmented Generation

Evidence Selection

Context Budget

Redundancy

Generation Quality

Innovation

Methods, ideas, or system contributions that make the work stand out.

Information Gain Pruning

Retrieval-Augmented Generation

Generator-Aligned Reranking