Aggregation Queries over Unstructured Text: Benchmark and Agentic Method

📅 2026-02-01

📈 Citations: 0

✨ Influential: 0

career value

173K/year

🤖 AI Summary

This work addresses the challenge of ensuring completeness—i.e., retrieving all relevant results rather than just one—in aggregating queries over unstructured text. Focusing on large-scale corpora, the paper formally defines the task of entity-level aggregation queries and introduces AGGBench, the first benchmark specifically designed to evaluate completeness in such settings. To tackle this problem, the authors propose DFA, a modular agent framework that decomposes query processing into three interpretable stages: disambiguation, filtering, and aggregation. Experimental results demonstrate that DFA significantly outperforms strong retrieval-augmented generation (RAG) baselines and existing agent-based approaches in terms of evidence coverage, thereby substantially improving the completeness of aggregation query results.

Technology Category

Application Category

📝 Abstract

Aggregation query over free text is a long-standing yet underexplored problem. Unlike ordinary question answering, aggregate queries require exhaustive evidence collection and systems are required to"find all,"not merely"find one."Existing paradigms such as Text-to-SQL and Retrieval-Augmented Generation fail to achieve this completeness. In this work, we formalize entity-level aggregation querying over text in a corpus-bounded setting with strict completeness requirement. To enable principled evaluation, we introduce AGGBench, a benchmark designed to evaluate completeness-oriented aggregation under realistic large-scale corpus. To accompany the benchmark, we propose DFA (Disambiguation--Filtering--Aggregation), a modular agentic baseline that decomposes aggregation querying into interpretable stages and exposes key failure modes related to ambiguity, filtering, and aggregation. Empirical results show that DFA consistently improves aggregation evidence coverage over strong RAG and agentic baselines. The data and code are available in \href{https://anonymous.4open.science/r/DFA-A4C1}.

Problem

Research questions and friction points this paper is trying to address.

aggregation query

unstructured text

completeness

entity-level aggregation

evidence collection

Innovation

Methods, ideas, or system contributions that make the work stand out.

aggregation query

completeness

AGGBench