QUESTER: Query Specification for Generative Retrieval

📅 2025-11-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Generative retrieval (GR) shows promise but suffers from poor generalization and high scalability costs. This paper proposes QUESTER, the first framework to reformulate GR as a query semantic generation task: it employs a lightweight language model to generate BM25-compatible keyword queries and introduces GRPO, a novel reinforcement learning algorithm, for end-to-end joint optimization. By generating retrievable keywords instead of document IDs, QUESTER circumvents the generalization bottleneck inherent in direct ID generation, thereby enhancing cross-domain robustness and computational efficiency. Experiments demonstrate that QUESTER outperforms BM25 on both in-domain and zero-shot cross-domain retrieval tasks, matching the effectiveness of state-of-the-art neural retrievers while offering low latency and straightforward deployment. Its core innovation lies in bridging generative paradigms with traditional retrieval infrastructure through semantically grounded, BM25-compatible query generation.

Technology Category

Application Category

📝 Abstract
Generative Retrieval (GR) differs from the traditional index-then-retrieve pipeline by storing relevance in model parameters and directly generating document identifiers. However, GR often struggles to generalize and is costly to scale. We introduce QUESTER (QUEry SpecificaTion gEnerative Retrieval), which reframes GR as query specification generation - in this work, a simple keyword query handled by BM25 - using a (small) LLM. The policy is trained using reinforcement learning techniques (GRPO). Across in- and out-of-domain evaluations, we show that our model is more effective than BM25, and competitive with neural IR models, while maintaining a good efficiency
Problem

Research questions and friction points this paper is trying to address.

Generative Retrieval struggles with generalization and scaling costs
It reframes retrieval as query specification using small LLMs
Trains policy with reinforcement learning for competitive neural IR performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative retrieval using reinforcement learning training
Query specification generation with small LLM
Competitive effectiveness with neural IR models
🔎 Similar Papers
No similar papers found.