GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human

📅 2025-01-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of detecting AI-generated content (AIGC) by establishing the first shared task for binary classification of machine-generated text in English and multiple languages. Methodologically, it introduces, for the first time at a COLING workshop, a unified dual-track evaluation framework—comprising separate English and multilingual tracks—with a reproducible, comparable, and openly accessible benchmark. The baseline system integrates state-of-the-art techniques, including supervised classifiers based on pretrained language models, zero-shot and few-shot prompting, feature distillation, and ensemble learning. Key contributions include: (1) a high-quality, manually annotated multilingual dataset; (2) standardized evaluation protocols; and (3) publicly released baseline systems. The task attracted 62 international teams; the top-performing system achieved F1 scores of 0.92 (English) and 0.85 (multilingual), substantially outperforming baselines and advancing the standardization and cross-lingual generalization of AIGC detection.

Technology Category

Application Category

📝 Abstract
We present the GenAI Content Detection Task~1 -- a shared task on binary machine generated text detection, conducted as a part of the GenAI workshop at COLING 2025. The task consists of two subtasks: Monolingual (English) and Multilingual. The shared task attracted many participants: 36 teams made official submissions to the Monolingual subtask during the test phase and 26 teams -- to the Multilingual. We provide a comprehensive overview of the data, a summary of the results -- including system rankings and performance scores -- detailed descriptions of the participating systems, and an in-depth analysis of submissions. https://github.com/mbzuai-nlp/COLING-2025-Workshop-on-MGT-Detection-Task1
Problem

Research questions and friction points this paper is trying to address.

Machine-written Text Detection
English and Other Languages
AI vs Human Writing
Innovation

Methods, ideas, or system contributions that make the work stand out.

GenAI Content Detection
Multilingual Machine-generated Text
Scientific Task
🔎 Similar Papers
2024-06-21Journal of Artificial Intelligence ResearchCitations: 6
Yuxia Wang
Yuxia Wang
MBZUAI
Natural Language Processing
Artem Shelmanov
Artem Shelmanov
MBZUAI
uncertainty estimationfairnessactive learningnlpdeep learning
Jonibek Mansurov
Jonibek Mansurov
PhD student in NLP, MBZUAI
NLP
Akim Tsvigun
Akim Tsvigun
Senior ML Architect @ Nebius AI
Natural Language ProcessingMachine LearningActive Learning
Vladislav Mikhailov
Vladislav Mikhailov
University of Oslo
LLMNLPbenchmarking
Rui Xing
Rui Xing
University of Melbourne
Natural Language ProcessingArtificial IntelligenceDeep Learning
Zhuohan Xie
Zhuohan Xie
MBZUAI
Financial AIReasoningNatural Language ProcessingComputational LinguisticsDeep Learning
Jiahui Geng
Jiahui Geng
Mohamed bin Zayed University of Artificial Intelligence
Artificial IntelligenceNatural Language Processing
G
Giovanni Puccetti
ISTI-CNR
Ekaterina Artemova
Ekaterina Artemova
Toloka.AI, ex-HSE, ex-LMU
natural language processingbenchmarkinglarge language models
Jinyan Su
Jinyan Su
Cornell university
LLM reasoningLLM agentretrieval augmented generation
M
Minh Ngoc Ta
BKAI Research Center, Hanoi University of Science and Technology
M
Mervat Abassy
Alexandria University
K
Kareem Elozeiri
Zewail City of Science and Technology
S
Saad El Dine Ahmed
Alexandria University
Maiya Goloburda
Maiya Goloburda
Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)
Uncertainty QuantificationTrustworthy NLPLLM SafetyLow-resource NLP
T
Tarek Mahmoud
MBZUAI
R
Raj Vardhan Tomar
Cluster Innovation Center, University of Delhi
Nurkhan Laiyk
Nurkhan Laiyk
MBZUAI
NLP
O
Osama Mohammed Afzal
MBZUAI
R
Ryuto Koike
Institute of Science Tokyo, MBZUAI
Alham Fikri Aji
Alham Fikri Aji
MBZUAI, Monash Indonesia
MultilingualityLow-resource NLPLanguage ModelingMachine Translation
Nizar Habash
Nizar Habash
Professor of Computer Science, New York University Abu Dhabi
Natural Language ProcessingComputational LinguisticsArtificial Intelligence
Iryna Gurevych
Iryna Gurevych
Full Professor, TU Darmstadt; Adjunct Professor, MBZUAI, UAE; Affiliated Professor, INSAIT, Bulgaria
Natural Language ProcessingLarge Language ModelsArtificial Intelligence
Preslav Nakov
Preslav Nakov
Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)
Computational LinguisticsLarge Language ModelsFact-checkingFake News