YEZE at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization via Heterogeneous Ensembling

📅 2026-05-07
📈 Citations: 0
Influential: 0
📄 PDF

career value

192K/year
🤖 AI Summary
This study addresses the challenge of identifying polarizing content in multilingual, multicultural social media by jointly tackling three subtasks—binary polarization detection, target classification, and manifestation identification—across 22 languages within the SemEval-2026 Task 9 framework. The authors propose an ensemble framework based on heterogeneous multilingual models (XLM-RoBERTa-large and mDeBERTa-v3-base), integrating task-specific modeling, class-weighting strategies, and translation-based data augmentation to effectively mitigate severe label sparsity and class imbalance. Experimental results demonstrate that the proposed approach achieves strong performance across all three subtasks, confirming its effectiveness and robustness in cross-lingual polarization detection.
📝 Abstract
This paper presents our system for SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization, which identifies polarized social media content in 22 languages through three subtasks: binary detection, target classification, and manifestation identification. We propose a heterogeneous ensemble of multilingual pretrained models, combining XLM-RoBERTa-large and mDeBERTa-v3-base. We investigate techniques such as multi-task learning, translation-based data augmentation, and class weighting to improve classification performance under severe label imbalance. Our findings indicate that independent task modeling combined with class weighting is more effective.
Problem

Research questions and friction points this paper is trying to address.

online polarization
multilingual
multicultural
multievent
social media
Innovation

Methods, ideas, or system contributions that make the work stand out.

heterogeneous ensembling
multilingual polarization detection
class weighting
translation-based data augmentation
pretrained language models
🔎 Similar Papers
No similar papers found.