🤖 AI Summary
In conversational search, users’ ambiguous queries pose significant challenges for precise reformulation into standalone search queries. To address this, we propose a test-time adaptive, prompt-driven query rewriting framework that operates via black-box API calls—requiring no access to large language model (LLM) parameters or fine-tuning. Our method dynamically optimizes rewriting outputs during inference. Key contributions include: (1) a lightweight reward model trained with contrastive ranking loss; and (2) an end-to-end, tuning-free test-time adaptation strategy that jointly leverages result-supervised reward modeling and Best-of-N re-ranking to select the optimal rewrite. Extensive experiments across five conversational search benchmarks demonstrate substantial improvements over state-of-the-art methods. Moreover, our approach exhibits strong robustness and generalization—achieving consistent performance on unseen domains and under diverse mainstream commercial LLM APIs.
📝 Abstract
Prompting-based conversational query reformulation has emerged as a powerful approach for conversational search, refining ambiguous user queries into standalone search queries. Best-of-N reformulation over the generated candidates via prompting shows impressive potential scaling capability. However, both the previous tuning methods (training time) and adaptation approaches (test time) can not fully unleash their benefits. In this paper, we propose AdaRewriter, a novel framework for query reformulation using an outcome-supervised reward model via test-time adaptation. By training a lightweight reward model with contrastive ranking loss, AdaRewriter selects the most promising reformulation during inference. Notably, it can operate effectively in black-box systems, including commercial LLM APIs. Experiments on five conversational search datasets show that AdaRewriter significantly outperforms the existing methods across most settings, demonstrating the potential of test-time adaptation for conversational query reformulation.