🤖 AI Summary
This work addresses the limitations of traditional retrieval-augmented generation (RAG) systems, which rely on static retrieval strategies and fail to dynamically select the optimal retriever for each query. Existing routing approaches further suffer from considering only semantic relevance while neglecting the actual utility of retrieved results in supporting answer correctness. To overcome these issues, the authors propose the R³AG framework, which decouples retriever capability into two learnable dimensions—“retrieval quality” and “generation utility”—and employs contrastive learning with dual supervision signals at both document and answer levels to model the dynamic alignment between queries and retriever capabilities. Experiments across multiple knowledge-intensive tasks demonstrate that R³AG significantly outperforms the strongest single retriever and current static routing methods, validating the effectiveness and superiority of its dynamic routing mechanism.
📝 Abstract
Retrieval-augmented generation (RAG) has become a cornerstone for knowledge-intensive tasks. However, the efficacy of RAG is often bottlenecked by the ``one-size-fits-all'' retrieval paradigm, as different queries exhibit distinct preferences for different retrievers. While recent routing techniques attempt to select the optimal retriever dynamically, they typically operate under a ``single and static capability'' assumption, selecting retrievers solely based on semantic relevance. This overlooks a critical distinction in RAG: a retrieved document must not only be relevant but also effectively support the generator in producing correct answers. To address this limitation, we propose R$^3$AG, a novel routing framework that explicitly models the dynamic alignment between queries and retriever capabilities. Unlike previous approaches, R$^3$AG decomposes retriever capability into two learnable dimensions: retrieval quality and generation utility. We employ a contrastive learning objective that leverages complementary supervision signals, \textit{i.e.}, document assessments and downstream answer correctness, to capture query-specific preference shifts. Extensive experiments on several knowledge-intensive tasks show that R$^3$AG consistently outperforms both the best individual retrievers and state-of-the-art static routing methods.