The CLEF-2025 CheckThat! Lab: Subjectivity, Fact-Checking, Claim Normalization, and Retrieval

📅 2025-03-19
📈 Citations: 0
✨ Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenges of detecting online disinformation and manipulative content—and ensuring interpretability—in multilingual, multi-platform environments. Methodologically, it introduces the first unified multi-task evaluation benchmark integrating subjectivity detection, claim standardization, numeric fact-checking, and scientific network discourse processing. It proposes an end-to-end interpretable fact-checking pipeline by systematically unifying auxiliary verification tasks—including subjectivity analysis and claim standardization—for the first time. Technically, it combines multilingual NLP, fine-grained span-level classification, semantic retrieval, and normalized semantic modeling. Key contributions include: (1) a cross-lingual, cross-task unified evaluation framework; (2) significant improvements in claim understanding and evidence alignment performance; and (3) enhanced robustness and interpretability of fact-checking systems—establishing a reusable technical foundation for global disinformation governance.

Technology Category

Application Category

📝 Abstract
The CheckThat! lab aims to advance the development of innovative technologies designed to identify and counteract online disinformation and manipulation efforts across various languages and platforms. The first five editions focused on key tasks in the information verification pipeline, including check-worthiness, evidence retrieval and pairing, and verification. Since the 2023 edition, the lab has expanded its scope to address auxiliary tasks that support research and decision-making in verification. In the 2025 edition, the lab revisits core verification tasks while also considering auxiliary challenges. Task 1 focuses on the identification of subjectivity (a follow-up from CheckThat! 2024), Task 2 addresses claim normalization, Task 3 targets fact-checking numerical claims, and Task 4 explores scientific web discourse processing. These tasks present challenging classification and retrieval problems at both the document and span levels, including multilingual settings.
Problem

Research questions and friction points this paper is trying to address.

Identify and counteract online disinformation across languages and platforms.
Address subjectivity, claim normalization, and fact-checking numerical claims.
Explore scientific web discourse processing in multilingual settings.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Identify subjectivity in multilingual online content
Normalize claims for consistent fact-checking analysis
Process scientific web discourse for verification
🔎 Similar Papers
No similar papers found.
F
Firoj Alam
Qatar Computing Research Institute, HBKU, Doha, Qatar
J
Julia Maria Struss
University of Applied Sciences Potsdam, Potsdam, Germany
T
Tanmoy Chakraborty
Indian Institute of Technology Delhi, New Delhi, India
Stefan Dietze
Stefan Dietze
Full Professor (Heinrich-Heine-University DĂźsseldorf) & Scientific Director (KTS, GESIS)
Knowledge GraphsInformation RetrievalWeb ScienceNLP
Stefan Dietze
Stefan Dietze
Full Professor (Heinrich-Heine-University DĂźsseldorf) & Scientific Director (KTS, GESIS)
Knowledge GraphsInformation RetrievalWeb ScienceNLP
S
Salim Hafid
University of Montpellier, LIRMM, CNRS, Montpellier, France
Katerina Korre
Katerina Korre
Postdoc researcher@Archimedes Research Unit
Arianna Muti
Arianna Muti
Postdoctoral Researcher at Bocconi University
Natural Language ProcessingComputational Social ScienceHate Speech
Preslav Nakov
Preslav Nakov
Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)
Computational LinguisticsLarge Language ModelsFact-checkingFake News
Federico Ruggeri
Federico Ruggeri
University of Bologna
Natural language processingdeep learningargumentation miningknowledge integration
S
Sebastian Schellhammer
GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany
Vinay Setty
Vinay Setty
Associate Professor, University of Stavanger
Fact-checkingInformation RetrievalLarge Language Models
Megha Sundriyal
Megha Sundriyal
Max Planck Institute for Security and Privacy, Germany
NLPAutomated Fact-CheckingComputational Social SciencesArgumentation Mining
Konstantin Todorov
Konstantin Todorov
UniversitĂŠ Montpellier, LIRMM
AIneurosymbolic AIknowledge graphsmachine learningsemantic technologies
V
V Venktesh
Delft University of Technology, The Netherlands