Polarity Detection of Sustainable Detection Goals in News Text

📅 2025-09-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper introduces SDG Polarity Detection, a novel task aimed at automatically identifying the directional impact—positive, neutral, or negative—of United Nations Sustainable Development Goals (SDGs) mentioned in news texts. To support this fine-grained semantic analysis, we construct SDG-POD, the first fully annotated benchmark dataset. We further pioneer the investigation of synthetic data augmentation for this task. Experiments systematically compare zero-shot inference and supervised fine-tuning across six state-of-the-art large language models. Results reveal limited zero-shot capability of current LLMs; however, fine-tuned QWQ-32B achieves significant gains over baselines across multiple SDG categories. Crucially, integrating real and synthetic data consistently improves macro-F1 by 4.2–8.7 percentage points. This work establishes a new task formulation, provides a foundational benchmark resource, and demonstrates an empirically validated pathway toward automated, polarity-aware analysis of sustainability-related text.

Technology Category

Application Category

📝 Abstract
The United Nations' Sustainable Development Goals (SDGs) provide a globally recognised framework for addressing critical societal, environmental, and economic challenges. Recent developments in natural language processing (NLP) and large language models (LLMs) have facilitated the automatic classification of textual data according to their relevance to specific SDGs. Nevertheless, in many applications, it is equally important to determine the directionality of this relevance; that is, to assess whether the described impact is positive, neutral, or negative. To tackle this challenge, we propose the novel task of SDG polarity detection, which assesses whether a text segment indicates progress toward a specific SDG or conveys an intention to achieve such progress. To support research in this area, we introduce SDG-POD, a benchmark dataset designed specifically for this task, combining original and synthetically generated data. We perform a comprehensive evaluation using six state-of-the-art large LLMs, considering both zero-shot and fine-tuned configurations. Our results suggest that the task remains challenging for the current generation of LLMs. Nevertheless, some fine-tuned models, particularly QWQ-32B, achieve good performance, especially on specific Sustainable Development Goals such as SDG-9 (Industry, Innovation and Infrastructure), SDG-12 (Responsible Consumption and Production), and SDG-15 (Life on Land). Furthermore, we demonstrate that augmenting the fine-tuning dataset with synthetically generated examples yields improved model performance on this task. This result highlights the effectiveness of data enrichment techniques in addressing the challenges of this resource-constrained domain. This work advances the methodological toolkit for sustainability monitoring and provides actionable insights into the development of efficient, high-performing polarity detection systems.
Problem

Research questions and friction points this paper is trying to address.

Detecting polarity of text relevance to Sustainable Development Goals
Assessing whether text indicates positive or negative SDG impact
Evaluating LLM performance on SDG polarity classification task
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposes SDG polarity detection task for text analysis
Introduces SDG-POD benchmark dataset with synthetic data
Evaluates LLMs with fine-tuning and data augmentation techniques
🔎 Similar Papers
No similar papers found.
A
Andrea Cadeddua
Linkalab s.r.l., Viale Elmas, 142, Cagliari, 09122, Italy
A
Alessandro Chessa
Linkalab s.r.l., Viale Elmas, 142, Cagliari, 09122, Italy
V
Vincenzo De Leo
Linkalab s.r.l., Viale Elmas, 142, Cagliari, 09122, Italy; Department of Mathematics and Computer Science, University of Cagliari, via Ospedale 72, Cagliari, 09124, Italy
G
Gianni Fenu
Department of Mathematics and Computer Science, University of Cagliari, via Ospedale 72, Cagliari, 09124, Italy
Francesco Osborne
Francesco Osborne
KMi, The Open University
Science of ScienceInformation ExtractionKnowledge GraphsArtificial IntelligenceSemantic Web
Diego Reforgiato Recupero
Diego Reforgiato Recupero
University of Cagliari
Semantic WebArtificial IntelligenceSentiment AnalysisInformation ExtractionBig Data
A
Angelo Salatino
Knowledge Media Institute, The Open University, Walton Hall, Kents Hill, Milton Keynes, MK76AA, United Kingdom
L
Luca Secchi
Linkalab s.r.l., Viale Elmas, 142, Cagliari, 09122, Italy