R$^3$AG: Retriever Routing for Retrieval-Augmented Generation

📅 2026-04-22
📈 Citations: 0
Influential: 0
📄 PDF

career value

182K/year
🤖 AI Summary
This work addresses the limitations of traditional retrieval-augmented generation (RAG) systems, which rely on static retrieval strategies and fail to dynamically select the optimal retriever for each query. Existing routing approaches further suffer from considering only semantic relevance while neglecting the actual utility of retrieved results in supporting answer correctness. To overcome these issues, the authors propose the R³AG framework, which decouples retriever capability into two learnable dimensions—“retrieval quality” and “generation utility”—and employs contrastive learning with dual supervision signals at both document and answer levels to model the dynamic alignment between queries and retriever capabilities. Experiments across multiple knowledge-intensive tasks demonstrate that R³AG significantly outperforms the strongest single retriever and current static routing methods, validating the effectiveness and superiority of its dynamic routing mechanism.

Technology Category

Application Category

📝 Abstract
Retrieval-augmented generation (RAG) has become a cornerstone for knowledge-intensive tasks. However, the efficacy of RAG is often bottlenecked by the ``one-size-fits-all'' retrieval paradigm, as different queries exhibit distinct preferences for different retrievers. While recent routing techniques attempt to select the optimal retriever dynamically, they typically operate under a ``single and static capability'' assumption, selecting retrievers solely based on semantic relevance. This overlooks a critical distinction in RAG: a retrieved document must not only be relevant but also effectively support the generator in producing correct answers. To address this limitation, we propose R$^3$AG, a novel routing framework that explicitly models the dynamic alignment between queries and retriever capabilities. Unlike previous approaches, R$^3$AG decomposes retriever capability into two learnable dimensions: retrieval quality and generation utility. We employ a contrastive learning objective that leverages complementary supervision signals, \textit{i.e.}, document assessments and downstream answer correctness, to capture query-specific preference shifts. Extensive experiments on several knowledge-intensive tasks show that R$^3$AG consistently outperforms both the best individual retrievers and state-of-the-art static routing methods.
Problem

Research questions and friction points this paper is trying to address.

Retrieval-Augmented Generation
Retriever Routing
Generation Utility
Query-Specific Preference
Knowledge-Intensive Tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Retrieval-Augmented Generation
Retriever Routing
Dynamic Alignment
Generation Utility
Contrastive Learning
🔎 Similar Papers
No similar papers found.