🤖 AI Summary
This work addresses the common challenge faced by data analysts who begin with vague questions and iteratively refine their information needs through exploration. To support this process, the authors propose Pneuma-Seeker, a novel system that leverages large language models (LLMs) not as opaque question-answering engines but as transparent, interactive analytical collaborators. Pneuma-Seeker enables users to explicitly articulate their information requirements as verifiable relational specifications and supports iterative refinement of these specifications, targeted data discovery, and execution with full provenance tracking. By integrating LLMs with relational specification modeling, data provenance, and an interactive interface, the system demonstrates its effectiveness in two real-world public procurement use cases, where it successfully helped users dynamically concretize evolving analytical needs and accurately retrieve relevant data.
📝 Abstract
Data analysts working with relational data often start with vague or underspecified questions and refine them iteratively as they explore the data. To support this iterative process, we demonstrate Pneuma-Seeker, a system that reifies a user's information need as explicit, inspectable relational specifications, enabling iterative refinement of the information need, targeted data discovery, and provenance-aware execution. Through two real-world procurement use cases, we show how Pneuma-Seeker leverages LLMs as transparent, interactive analytical collaborators rather than opaque answer engines.