Cross-Platform Violence Detection on Social Media: A Dataset and Analysis

📅 2025-05-19
🏛️ Web Science Conference
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Cross-platform violent content detection is hindered by the scarcity of high-quality, fine-grained annotated datasets—particularly those covering subtypes such as political and sexual violence across multiple platforms. Method: We construct the first large-scale, manually annotated cross-platform violent threat dataset comprising 30,000 instances from Weibo, Twitter, and Reddit, supporting both binary classification and fine-grained multi-subtype recognition. We conduct supervised learning and cross-platform transfer evaluation to assess representational consistency. Contribution/Results: Empirical results demonstrate strong cross-platform consistency in violent content representations: models trained on a single platform achieve high accuracy when tested on others, and performance further improves when training on merged multi-source data. This challenges the “platform-isolated modeling” assumption and validates the semantic transferability of violent content representations. Our dataset and findings provide a critical empirical foundation and methodological validation for robust, generalizable cross-platform content safety governance.

Technology Category

Application Category

📝 Abstract
Violent threats remain a significant problem across social media platforms. Useful, high-quality data facilitates research into the understanding and detection of malicious content, including violence. In this paper, we introduce a cross-platform dataset of 30,000 posts hand-coded for violent threats and sub-types of violence, including political and sexual violence. To evaluate the signal present in this dataset, we perform a machine learning analysis with an existing dataset of violent comments from YouTube. We find that, despite originating from different platforms and using different coding criteria, we achieve high classification accuracy both by training on one dataset and testing on the other, and in a merged dataset condition. These results have implications for content-classification strategies and for understanding violent content across social media.
Problem

Research questions and friction points this paper is trying to address.

Detecting violent threats across different social media platforms
Creating a high-quality dataset for violence classification research
Evaluating cross-platform machine learning models for violence detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-platform dataset of 30,000 hand-coded posts
Machine learning analysis for violence detection
High classification accuracy across different platforms
Celia Chen
Celia Chen
University of Maryland
Scotty Beland
Scotty Beland
University of Maryland College Park
Archivesuser studiesinformation behaviordigital divide
I
Ingo Burghardt
University of Maryland College Park, MD
J
Jill Byczek
University of Maryland College Park, MD
W
William J. Conway
University of Maryland College Park, MD
E
Eric Cotugno
University of Maryland College Park, MD
S
Sadaf Davre
University of Maryland College Park, MD
M
Megan Fletcher
University of Maryland College Park, MD
R
Rajesh Kumar Gnanasekaran
University of Maryland College Park, MD
K
Kristin Hamilton
University of Maryland College Park, MD
J
Jordan Heustis
University of Maryland College Park, MD
T
Tanaya Jha
University of Maryland College Park, MD
Emily Klein
Emily Klein
Pew Charitable Trusts
Marine ecologycoupled human-natural systemsfisheries management and policyhistorical marine ecologydiversity and inclusi
H
Hayden Kramer
University of Maryland College Park, MD
A
Alex Leitch
University of Maryland College Park, MD
J
Jessica Perkins
University of Maryland College Park, MD
C
Casi Sherman
University of Maryland College Park, MD
C
Celia Sterrn
University of Maryland College Park, MD
L
Logan Stevens
University of Maryland College Park, MD
R
Rebecca Zarrella
University of Maryland College Park, MD
Jennifer Golbeck
Jennifer Golbeck
University of Maryland, College Park
social networkssocial mediatrustrecommender systems