EulerESG: Automating ESG Disclosure Analysis with LLMs

📅 2025-11-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
ESG reports are predominantly published as unstructured, format-heterogeneous PDFs, impeding standardized information extraction and cross-report alignment. To address this, we propose the first method that explicitly incorporates disclosure frameworks—such as the Sustainability Accounting Standards Board (SASB)—into large language model (LLM)-based analysis via a dual-channel retrieval-augmented system: one channel performs semantic retrieval using a vector database, while the other leverages natural language inference for precise standard clause alignment. Our system supports automated metric population, cross-firm benchmarking, and interactive exploration. Integrated end-to-end, it combines information extraction, interactive visualization dashboards, and conversational querying. Evaluated on ESG reports from four global corporations across 12 SASB sub-industries, it achieves a 0.95 average accuracy. The implementation—including source code and an interactive demo—is publicly released.

Technology Category

Application Category

📝 Abstract
Environmental, Social, and Governance (ESG) reports have become central to how companies communicate climate risk, social impact, and governance practices, yet they are still published primarily as long, heterogeneous PDF documents. This makes it difficult to systematically answer seemingly simple questions. Existing tools either rely on brittle rule-based extraction or treat ESG reports as generic text, without explicitly modelling the underlying reporting standards. We present extbf{EulerESG}, an LLM-powered system for automating ESG disclosure analysis with explicit awareness of ESG frameworks. EulerESG combines (i) dual-channel retrieval and LLM-driven disclosure analysis over ESG reports, and (ii) an interactive dashboard and chatbot for exploration, benchmarking, and explanation. Using four globally recognised companies and twelve SASB sub-industries, we show that EulerESG can automatically populate standard-aligned metric tables with high fidelity (up to 0.95 average accuracy) while remaining practical in end-to-end runtime, and we compare several recent LLM models in this setting. The full implementation, together with a demonstration video, is publicly available at https://github.com/UNSW-database/EulerESG.
Problem

Research questions and friction points this paper is trying to address.

Automates analysis of lengthy ESG reports using LLMs with framework awareness
Addresses limitations of rule-based extraction and generic text processing methods
Enables systematic disclosure analysis through retrieval and interactive exploration tools
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-powered system for ESG disclosure analysis
Dual-channel retrieval and LLM-driven analysis
Interactive dashboard and chatbot for exploration
🔎 Similar Papers
No similar papers found.
Y
Yi Ding
UNSW Sydney
X
Xushuo Tang
UNSW Sydney
Z
Zhengyi Yang
UNSW Sydney
Wenqian Zhang
Wenqian Zhang
UNSW Sydney
S
Simin Wu
Eigenflow AI
Yuxin Huang
Yuxin Huang
Unknown affiliation
L
Lingjing Lan
UNSW Sydney
Weiyuan Li
Weiyuan Li
Alibaba Group
RLLLMAgent
Yin Chen
Yin Chen
Lecturer in Mathematics at University of Saskatchewan
Invariant theoryLie theoryCommutative algebraApplied algebraic geometry
M
Mingchen Ju
UNSW Sydney
W
Wenke Yang
UNSW Sydney
T
Thong Hoang
Data61, CSIRO
M
Mykhailo Klymenko
Data61, CSIRO
X
Xiwei Xu
Data61, CSIRO
W
Wenjie Zhang
UNSW Sydney