Your Dense Retriever is Secretly an Expeditious Reasoner

📅 2025-09-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Dense retrievers underperform on complex, reasoning-intensive queries, while exhaustive large language model (LLM) invocation incurs prohibitive computational overhead. To address this trade-off, we propose AdaQR, an adaptive query reasoning framework that implicitly embeds LLM-based reasoning capabilities into the vector space. AdaQR introduces a lightweight routing module that dynamically selects between efficient dense retrieval and deep LLM-based query rewriting—enabling on-demand, in-embedding-space reasoning enhancement without universal LLM invocation. This design eliminates unnecessary LLM calls while preserving semantic fidelity and reasoning depth. Evaluated on the large-scale BRIGHT benchmark, AdaQR achieves a 7% improvement in retrieval effectiveness while reducing inference cost by 28%, demonstrating a significant advance in balancing accuracy and efficiency for open-domain retrieval.

Technology Category

Application Category

📝 Abstract
Dense retrievers enhance retrieval by encoding queries and documents into continuous vectors, but they often struggle with reasoning-intensive queries. Although Large Language Models (LLMs) can reformulate queries to capture complex reasoning, applying them universally incurs significant computational cost. In this work, we propose Adaptive Query Reasoning (AdaQR), a hybrid query rewriting framework. Within this framework, a Reasoner Router dynamically directs each query to either fast dense reasoning or deep LLM reasoning. The dense reasoning is achieved by the Dense Reasoner, which performs LLM-style reasoning directly in the embedding space, enabling a controllable trade-off between efficiency and accuracy. Experiments on large-scale retrieval benchmarks BRIGHT show that AdaQR reduces reasoning cost by 28% while preserving-or even improving-retrieval performance by 7%.
Problem

Research questions and friction points this paper is trying to address.

Dense retrievers struggle with complex reasoning-intensive queries
Universal LLM query reformulation incurs high computational costs
Framework enables efficient trade-off between reasoning accuracy and cost
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid query rewriting framework AdaQR
Dynamic routing between dense and LLM reasoning
Dense Reasoner performs reasoning in embedding space
🔎 Similar Papers
No similar papers found.
Y
Yichi Zhang
School of Computer Science and Engineering, Beihang University, China
Jun Bai
Jun Bai
Assistant professor
Computer aided drug discoveryMedical image analysisAI therapeutic target identification
Z
Zhixin Cai
School of Computer Science and Engineering, Beihang University, China
S
Shuhan Qin
School of Computer Science and Engineering, Beihang University, China
Z
Zhuofan Chen
School of Computer Science and Engineering, Beihang University, China
J
Jinghua Guan
School of Computer Science and Engineering, Beihang University, China
Wenge Rong
Wenge Rong
Beihang University
Natural Language ProcessingMachine LearningInformation Systems