Structuring the Unstructured: A Multi-Agent System for Extracting and Querying Financial KPIs and Guidance

📅 2025-05-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Low efficiency and poor generalizability in parsing unstructured financial earnings reports hinder automated financial analysis. Method: This paper proposes a dual-agent large language model framework tailored for finance: an Extraction Agent that automatically identifies, standardizes, and validates KPIs; and a Text-to-SQL Agent enabling schema-agnostic natural language ad-hoc querying. We introduce the first collaborative multi-agent architecture that decouples structured information extraction from semantic querying, integrating prompt engineering, structured pipelines, and human-in-the-loop verification. Contribution/Results: Evaluated on real-world financial reports, our approach achieves 95% KPI extraction accuracy—on par with human experts—and 91% correctness in NL2SQL response generation, with robust cross-document generalization. It significantly advances end-to-end automation, scalability, and practicality of financial report structuring and analysis, overcoming key performance and deployment limitations of traditional rule-based systems and fine-tuned models.

Technology Category

Application Category

📝 Abstract
Extracting structured and quantitative insights from unstructured financial filings is essential in investment research, yet remains time-consuming and resource-intensive. Conventional approaches in practice rely heavily on labor-intensive manual processes, limiting scalability and delaying the research workflow. In this paper, we propose an efficient and scalable method for accurately extracting quantitative insights from unstructured financial documents, leveraging a multi-agent system composed of large language models. Our proposed multi-agent system consists of two specialized agents: the emph{Extraction Agent} and the emph{Text-to-SQL Agent}. The extit{Extraction Agent} automatically identifies key performance indicators from unstructured financial text, standardizes their formats, and verifies their accuracy. On the other hand, the extit{Text-to-SQL Agent} generates executable SQL statements from natural language queries, allowing users to access structured data accurately without requiring familiarity with the database schema. Through experiments, we demonstrate that our proposed system effectively transforms unstructured text into structured data accurately and enables precise retrieval of key information. First, we demonstrate that our system achieves approximately 95% accuracy in transforming financial filings into structured data, matching the performance level typically attained by human annotators. Second, in a human evaluation of the retrieval task -- where natural language queries are used to search information from structured data -- 91% of the responses were rated as correct by human evaluators. In both evaluations, our system generalizes well across financial document types, consistently delivering reliable performance.
Problem

Research questions and friction points this paper is trying to address.

Extracting structured financial insights from unstructured documents
Automating labor-intensive manual KPI extraction processes
Enabling accurate natural language queries for financial data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-agent system for financial KPI extraction
Extraction Agent standardizes and verifies KPIs
Text-to-SQL Agent generates SQL from queries
🔎 Similar Papers
No similar papers found.
C
Chanyeol Choi
LinqAlpha, New York, NY, USA
Alejandro Lopez-Lira
Alejandro Lopez-Lira
Assistant Professor of Finance, University of Florida
FintechMachine LearningAsset PricingMacro FinancePrivate Equity
Jihoon Kwon
Jihoon Kwon
Seoul National University / Hanwha systems
Radar signal processingRadar machine learningTracking filterMicrowave applications
M
Minjae Kim
LinqAlpha, New York, NY, USA
J
Juneha Hwang
LinqAlpha, New York, NY, USA
M
Minsoo Ha
LinqAlpha, New York, NY, USA
C
Chaewoon Kim
LinqAlpha, New York, NY, USA
Jaeseon Ha
Jaeseon Ha
LinqAlpha
AIfundamental research
S
Suyeol Yun
LinqAlpha, New York, NY, USA
J
Jin Kim
LinqAlpha, New York, NY, USA