Word2winners at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval

📅 2025-03-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses SemEval 2025 Task 7—multilingual fact-checking retrieval—by retrieving semantically matching, human-verified fact checks for given social media claims from the multilingual MultiClaim dataset. To overcome zero-shot cross-lingual retrieval bottlenecks, we propose a supervised fine-tuning framework integrating machine translation: leveraging mBERT or XLM-R as backbones, we introduce back-translation–enhanced cross-lingual alignment and bilingual collaborative optimization. Our method simultaneously improves retrieval accuracy in both monolingual and cross-lingual settings, achieving 92% and 85% accuracy, respectively—substantially outperforming baselines and ranking among the top systems on the official leaderboard. The core contribution lies in explicitly modeling translation fidelity within the retrieval fine-tuning process, thereby unifying semantic alignment and domain adaptation into a single optimization objective.

Technology Category

Application Category

📝 Abstract
This paper describes our system for SemEval 2025 Task 7: Previously Fact-Checked Claim Retrieval. The task requires retrieving relevant fact-checks for a given input claim from the extensive, multilingual MultiClaim dataset, which comprises social media posts and fact-checks in several languages. To address this challenge, we first evaluated zero-shot performance using state-of-the-art English and multilingual retrieval models and then fine-tuned the most promising systems, leveraging machine translation to enhance crosslingual retrieval. Our best model achieved an accuracy of 85% on crosslingual data and 92% on monolingual data.
Problem

Research questions and friction points this paper is trying to address.

Retrieve relevant fact-checks for claims
Use multilingual and crosslingual retrieval models
Achieve high accuracy on crosslingual and monolingual data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Utilized multilingual retrieval models
Fine-tuned with machine translation
Achieved high crosslingual accuracy
🔎 Similar Papers
No similar papers found.
A
Amirmohammad Azadi
Iran University of Science and Technology
S
Sina Zamani
Iran University of Science and Technology
M
Mohammadmostafa Rostamkhani
Iran University of Science and Technology
Sauleh Eetemadi
Sauleh Eetemadi
University of Birmingham Dubai, Microsoft Research, Michigan State University
Artificial IntelligenceMachine LearningNLPMachine TranslationActive Learning