From Birdwatch to Community Notes, from Twitter to X: four years of community-based content moderation

πŸ“… 2025-10-10
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study systematically evaluates the effectiveness and evolutionary dynamics of X’s Community Notes crowdsourced content moderation system. Addressing the lack of long-term, structured empirical data in prior work, we construct the first large-scale, open-source dataset covering 2020–2024, featuring multilingual detection, fine-grained topic classification, URL semantic extraction, and monthly user collaboration networks. Methodologically, we integrate NLP, language identification, dynamic topic modeling, and complex network analysis, complemented by a systematic literature review. Our key contributions are: (1) releasing the first four-year structured Community Notes dataset alongside a complete, reproducible analytical toolchain; (2) uncovering cross-lingual and cross-topical mechanisms of consensus formation and bias patterns in community moderation; and (3) establishing a replicable empirical framework and standardized research infrastructure for trustworthy platform content governance.

Technology Category

Application Category

πŸ“ Abstract
Community Notes (formerly known as Birdwatch) is the first large-scale crowdsourced content moderation initiative that was launched by X (formerly known as Twitter) in January 2021. As the Community Notes model gains momentum across other social media platforms, there is a growing need to assess its underlying dynamics and effectiveness. This Resource paper provides (a) a systematic review of the literature on Community Notes, and (b) a major curated dataset and accompanying source code to support future research on Community Notes. We parsed Notes and Ratings data from the first four years of the program and conducted language detection across all Notes. Focusing on English-language Notes, we extracted embedded URLs and identified discussion topics in each Note. Additionally, we constructed monthly interaction networks among the Contributors. Together with the literature review, these resources offer a robust foundation for advancing research on the Community Notes system.
Problem

Research questions and friction points this paper is trying to address.

Evaluating effectiveness of crowdsourced content moderation systems
Analyzing dynamics of Community Notes program on social media
Providing datasets for research on community-based fact checking
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large-scale crowdsourced content moderation system
Parsed Notes and Ratings data over four years
Constructed monthly interaction networks among Contributors
πŸ”Ž Similar Papers
No similar papers found.
S
Saeedeh Mohammadi
School of Mathematics and Statistics, University College Dublin, Dublin, Ireland
N
Narges Chinichian
Institute for Theoretical Physics, Technical University of Berlin, Berlin, Germany
H
Hannah Doyal
SPICED Academy, Berlin, Germany
K
Kristina Skutilova
School of Computer Science, University College Dublin, Dublin, Ireland
Hao Cui
Hao Cui
University of California, Irvine
privacy policyimage watermarking
M
Michele d'Errico
School of Social Sciences and Philosophy, Trinity College Dublin, Dublin, Ireland
S
SiobhΓ‘n Grayson
School of Sociology, University College Dublin, Dublin, Ireland
T
T. Yasseri
School of Social Sciences and Philosophy, Trinity College Dublin, Dublin, Ireland