Advancing ESG Intelligence: An Expert-level Agent and Comprehensive Benchmark for Sustainable Finance

📅 2026-01-13

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This work addresses the challenges posed by fragmented unstructured data and the need for rigorous, multi-step auditing in ESG (Environmental, Social, and Governance) analysis, which current large language models struggle to handle effectively. To this end, we propose ESGAgent—the first expert-level, hierarchical multi-agent framework tailored for sustainable finance—that integrates retrieval-augmented generation, real-time web search, and domain-specific tools to support end-to-end analysis, ranging from atomic-level question answering to comprehensive report generation. We also introduce the first three-tier ESG capability evaluation benchmark, spanning foundational comprehension to professional reporting. Experimental results demonstrate that ESGAgent achieves an average accuracy of 84.15% on atomic tasks, significantly outperforming leading closed-source models, and is capable of producing professional-grade reports complete with visualizations and verifiable citations.

Technology Category

Application Category

📝 Abstract

Environmental, social, and governance (ESG) criteria are essential for evaluating corporate sustainability and ethical performance. However, professional ESG analysis is hindered by data fragmentation across unstructured sources, and existing large language models (LLMs) often struggle with the complex, multi-step workflows required for rigorous auditing. To address these limitations, we introduce ESGAgent, a hierarchical multi-agent system empowered by a specialized toolset, including retrieval augmentation, web search and domain-specific functions, to generate in-depth ESG analysis. Complementing this agentic system, we present a comprehensive three-level benchmark derived from 310 corporate sustainability reports, designed to evaluate capabilities ranging from atomic common-sense questions to the generation of integrated, in-depth analysis. Empirical evaluations demonstrate that ESGAgent outperforms state-of-the-art closed-source LLMs with an average accuracy of 84.15% on atomic question-answering tasks, and excels in professional report generation by integrating rich charts and verifiable references. These findings confirm the diagnostic value of our benchmark, establishing it as a vital testbed for assessing general and advanced agentic capabilities in high-stakes vertical domains.

Problem

Research questions and friction points this paper is trying to address.

ESG analysis

data fragmentation

large language models

multi-step workflows

sustainable finance

Innovation

Methods, ideas, or system contributions that make the work stand out.

ESGAgent

multi-agent system

retrieval-augmented generation

sustainable finance benchmark

domain-specific LLM evaluation

🔎 Similar Papers

No similar papers found.

Authors to Follow