AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning

📅 2026-04-19
📈 Citations: 0
Influential: 0
📄 PDF

career value

202K/year
🤖 AI Summary
This work addresses the high latency in multi-step retrieval-augmented generation (Agentic RAG) systems caused by redundant search queries and the inherent trade-off between efficiency and sufficiency under fixed search depths. To overcome these limitations, the authors propose AutoSearch, a novel framework that, for the first time, integrates reinforcement learning with a self-answering mechanism to dynamically assess the necessity of each retrieval step and adaptively determine the minimal sufficient search depth. By employing reward modeling to stabilize search behavior, AutoSearch significantly outperforms existing methods across multiple benchmarks, reducing redundant retrievals while maintaining or even improving answer accuracy, thereby achieving a superior balance between efficiency and correctness.

Technology Category

Application Category

📝 Abstract
Agentic retrieval-augmented generation (RAG) systems enable large language models (LLMs) to solve complex tasks through multi-step interaction with external retrieval tools. However, such multi-step interaction often involves redundant search steps, incurring substantial computational cost and latency. Prior work limits search depth (i.e., the number of search steps) to reduce cost, but this often leads to underexploration of complex questions. To address this, we first investigate how search depth affects accuracy and find a minimal sufficient search depth that defines an accuracy-efficiency trade-off, jointly determined by question complexity and the agent's capability. Furthermore, we propose AutoSearch, a reinforcement learning (RL) framework that evaluates each search step via self-generated intermediate answers. By a self-answering mechanism, AutoSearch identifies the minimal sufficient search depth and promotes efficient search by rewarding its attainment while penalizing over-searching. In addition, reward mechanisms are introduced to stabilize search behavior and improve answer quality on complex questions. Extensive experiments on multiple benchmarks show that AutoSearch achieves a superior accuracy-efficiency trade-off, alleviating over-searching while preserving search quality.
Problem

Research questions and friction points this paper is trying to address.

Agentic RAG
search depth
redundant search
accuracy-efficiency trade-off
complex questions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Adaptive Search Depth
Agentic RAG
Reinforcement Learning
Self-answering Mechanism
Accuracy-Efficiency Trade-off
🔎 Similar Papers
No similar papers found.
J
Jingbo Sun
SKL-MAIS, Institute of Automation, Chinese Academy of Sciences; School of Artificial Intelligence, University of Chinese Academy of Sciences
W
Wenyue Chong
SKL-MAIS, Institute of Automation, Chinese Academy of Sciences; School of Advanced Interdisciplinary Sciences, University of Chinese Academy of Sciences
Songjun Tu
Songjun Tu
Institute of Automation, Chinese Academy of Sciences; Pengcheng Laboratory
Large Language ModelsReinforecement Learning
Qichao Zhang
Qichao Zhang
中国科学院自动化研究所
人工智能 强化学习 博弈论 自适应动态规划
Y
Yaocheng Zhang
SKL-MAIS, Institute of Automation, Chinese Academy of Sciences; School of Advanced Interdisciplinary Sciences, University of Chinese Academy of Sciences
Jiajun Chai
Jiajun Chai
Meituan Inc.
Reinforcement LearningLLMsAgentic Learning
Xiaohan Wang
Xiaohan Wang
Meituan
Deep reinforcement learningOptimizationCausal inferenceScheduling
W
Wei Lin
Meituan
Guojun Yin
Guojun Yin
Meituan, University of Science and Technology of China
MultimodalityComputer VisionFoundation ModelsDeep LearningImage/Video Processing
Dongbin Zhao
Dongbin Zhao
Institute of Automation, Chinese Academy of Sciences
Deep Reinforcement LearningAdaptive Dynamic ProgrammingGame AISmart drivingrobotics