EmeraldMind: A Knowledge Graph-Augmented Framework for Greenwashing Detection

📅 2025-12-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of automating greenwashing detection—i.e., identifying misleading corporate sustainability claims. We propose the first fact-centered detection framework, built upon EmeraldGraph, a domain-specific knowledge graph for ESG (Environmental, Social, and Governance) data. Our method integrates structured extraction from multi-source ESG reports, retrieval-augmented generation (RAG), and zero-shot large language model (LLM) reasoning—requiring no fine-tuning for claim verification. Key contributions include: (1) a verification paradigm anchored in verifiable facts; (2) an evidence-driven, interpretable decision mechanism; and (3) a transparent decision process supporting justified abstention. Evaluated on a novel greenwashing benchmark dataset, our approach significantly outperforms general-purpose LMs in accuracy, coverage, and explanation quality—bridging critical gaps in domain knowledge integration and explainability for sustainable claim validation.

Technology Category

Application Category

📝 Abstract
As AI and web agents become pervasive in decision-making, it is critical to design intelligent systems that not only support sustainability efforts but also guard against misinformation. Greenwashing, i.e., misleading corporate sustainability claims, poses a major challenge to environmental progress. To address this challenge, we introduce EmeraldMind, a fact-centric framework integrating a domain-specific knowledge graph with retrieval-augmented generation to automate greenwashing detection. EmeraldMind builds the EmeraldGraph from diverse corporate ESG (environmental, social, and governance) reports, surfacing verifiable evidence, often missing in generic knowledge bases, and supporting large language models in claim assessment. The framework delivers justification-centric classifications, presenting transparent, evidence-backed verdicts and abstaining responsibly when claims cannot be verified. Experiments on a new greenwashing claims dataset demonstrate that EmeraldMind achieves competitive accuracy, greater coverage, and superior explanation quality compared to generic LLMs, without the need for fine-tuning or retraining.
Problem

Research questions and friction points this paper is trying to address.

Detects misleading corporate sustainability claims automatically
Integrates domain-specific knowledge graphs with retrieval-augmented generation
Provides transparent, evidence-backed verdicts for greenwashing assessment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates domain-specific knowledge graph with retrieval-augmented generation
Builds evidence graph from corporate ESG reports for verification
Delivers transparent, evidence-backed classifications without model retraining
G
Georgios Kaoukis
Archimedes, Athena Research Center, Greece
I
Ioannis Aris Koufopoulos
Archimedes, Athena Research Center, Greece
P
Psaroudaki Eleni
Archimedes, Athena Research Center, Greece
D
Danae Pla Karidi
Archimedes, Athena Research Center, Greece
Evaggelia Pitoura
Evaggelia Pitoura
University of Ioannina, Greece
Data management
George Papastefanatos
George Papastefanatos
Research Director, Athena Research Center
Data ManagementVisual AnalyticsCloud ComputingData PipelinesData evolution
Panayiotis Tsaparas
Panayiotis Tsaparas
Archimedes, Athena Research Center, Greece; University of Ioannina, Greece