Multi-Turn Interactions for Text-to-SQL with Large Language Models

šŸ“… 2024-08-09
šŸ›ļø Proceedings of the 34th ACM International Conference on Information and Knowledge Management
šŸ“ˆ Citations: 1
✨ Influential: 0
šŸ“„ PDF
šŸ¤– AI Summary
To address the low parsing efficiency, opaque interaction, and poor generalization of large language models (LLMs) in Text-to-SQL for wide-table scenarios, this paper proposes Interactive-T2S—a framework enabling iterative human-AI collaboration wherein the LLM directly interacts with the database to generate SQL progressively and transparently. Its core contributions are: (1) four generic, schema-agnostic database interaction tools that support cross-schema generalization; and (2) a structured, example-driven stepwise reasoning paradigm integrating dynamic context construction and chain-of-thought prompting. Evaluated on Spider and BIRD (including its variants), Interactive-T2S achieves new state-of-the-art performance on the BIRD leaderboard under the non-oracle setting, significantly improving both query accuracy on wide-table schemas and the interpretability of human-system interaction.

Technology Category

Application Category

šŸ“ Abstract
This study explores text-to-SQL parsing by leveraging the powerful reasoning capabilities of large language models (LLMs). Despite recent advancements, existing LLM-based methods are still inefficient and struggle to handle cases with wide tables effectively. Furthermore, current interaction-based approaches either lack a step-by-step, interpretable SQL generation process or fail to provide a universally applicable interaction design. To address these challenges, we introduce Interactive-T2S, a framework that generates SQL queries through direct interactions with databases. This framework includes four general tools that facilitate proactive and efficient information retrieval by the LLM. Additionally, we have developed detailed exemplars to demonstrate the step-wise reasoning processes within our framework. Our approach achieves advanced performance on the Spider and BIRD datasets as well as their variants. Notably, we obtain state-of-the-art results on the BIRD leaderboard under the setting without oracle knowledge, demonstrating the effectiveness of our method.
Problem

Research questions and friction points this paper is trying to address.

Addresses inefficient SQL generation with wide tables
Provides step-by-step interpretable SQL query generation
Creates universally applicable database interaction framework
Innovation

Methods, ideas, or system contributions that make the work stand out.

Interactive-T2S framework for direct database interactions
Four general tools enabling proactive information retrieval
Step-wise reasoning exemplars for interpretable SQL generation
šŸ”Ž Similar Papers
No similar papers found.
G
Guanming Xiong
Peking University, Beijing, China
Junwei Bao
Junwei Bao
zuoyebang.com // JD.com // MSRA
NLPLLMQA+DialogGeneration
H
Hongfei Jiang
Zuoyebang Education Technology Co., Ltd., Beijing, China
Y
Yang Song
Zuoyebang Education Technology Co., Ltd., Beijing, China
Wen Zhao
Wen Zhao
JSPS International Fellow, UT-Austin Postdoc, KAUST
MEMSSensorNonlinear Dynamics