CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation

πŸ“… 2026-05-06
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

177K/year
πŸ“ Abstract
Retrieval-Augmented Generation (RAG) depends on document ranking to provide useful evidence for generation, but conventional reranking methods mainly optimize query-document relevance rather than generation usefulness. A relevant document may still introduce noise, while a lower-ranked document may better reduce the generator's uncertainty. We propose CAR (Confidence-Aware Reranking), a query-guided, training-free, and plug-and-play reranking framework that uses generator confidence change as a document usefulness signal. CAR estimates confidence through the semantic consistency of multiple sampled answers under query-only and query-document conditions. Documents that significantly increase confidence are promoted, those that decrease confidence are demoted, and uncertain cases preserve the baseline order, while a query-level gate avoids unnecessary intervention on already confident queries. Experiments on four BEIR datasets show that CAR consistently improves NDCG@5 across sparse and dense retrievers, LLM-based and supervised rerankers, and four LLM backbones. Notably, CAR improves the YesNo reranker by 25.4 percent on average under Contriever retrieval, and its ranking gains strongly correlate with downstream generation F1 improvements, achieving Spearman rho = 0.964.
Problem

Research questions and friction points this paper is trying to address.

Retrieval-Augmented Generation
reranking
generation usefulness
document relevance
generator confidence
Innovation

Methods, ideas, or system contributions that make the work stand out.

Confidence-Aware Reranking
Retrieval-Augmented Generation
Generator Confidence
Semantic Consistency
Plug-and-Play Reranking
πŸ”Ž Similar Papers
No similar papers found.
Z
Zhipeng Song
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China
Yizhi Zhou
Yizhi Zhou
George mason university
RoboticsSLAMState Estimation
Xiangyu Kong
Xiangyu Kong
Beijing Information Science & Technology University
Audio/Speech ProcessingReinforcement LearningComputer Vision
J
Jiulong Jiao
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China
X
Xuezhou Ye
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China
C
Chunqi Gao
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China
X
Xueqing Shi
College of Health-Preservation and Wellness, Dalian Medical University, No. 9 West Section of Lvshun South Road, Lvshunkou District, Dalian, 116044, China
Y
Yuhang Zhou
Tencent (Dalian Northern Interactive Entertainment Technology Co., Ltd.), 21/F, Tencent Building, No. 26 Jingxian St, Ganjingzi District, Dalian, 116085, China
H
Heng Qi
School of Computer Science and Technology, Dalian University of Technology, No.2 Linggong Road, Ganjingzi District, Dalian, 116024, China