🤖 AI Summary
Ambiguous user queries degrade the quality of long-form question answering (LFQA) by large language models (LLMs), particularly when interacting with black-box LLMs.
Method: This paper proposes an instance-level, single-turn query rewriter to enhance query interpretability for black-box LLMs. It introduces a novel direct preference optimization (DPO) training paradigm guided by an automated answer quality discriminator—eliminating the need for human annotations—and incorporates a lightweight, plug-and-play prompt optimization mechanism.
Contribution/Results: Extensive experiments across multiple black-box LLMs and LFQA benchmarks demonstrate that the rewriter significantly improves answer quality, exhibits strong generalization across models and domains, and maintains practical deployability. The approach is computationally efficient and does not require model fine-tuning or API access beyond standard inference. All code is publicly released.
📝 Abstract
Large Language Models (LLMs) have demonstrated significant capabilities, particularly in the domain of question answering (QA). However, their effectiveness in QA is often undermined by the vagueness of user questions. To address this issue, we introduce single-round instance-level prompt optimization, referred to as question rewriter. By enhancing the intelligibility of human questions for black-box LLMs, our question rewriter improves the quality of generated answers. The rewriter is optimized using direct preference optimization based on feedback collected from automatic criteria for evaluating generated answers; therefore, its training does not require costly human annotations. The experiments across multiple black-box LLMs and long-form question answering (LFQA) datasets demonstrate the efficacy of our method. This paper provides a practical framework for training question rewriters and sets a precedent for future explorations in prompt optimization within LFQA tasks. Code is available at https://github.com/3244we/Question-Rewriter.