Web(er) of Hate: A Survey on How Hate Speech Is Typed

📅 2025-06-19

📈 Citations: 0

✨ Influential: 0

career value

218K/year

🤖 AI Summary

Hate speech dataset construction faces methodological trade-offs, with prevailing practices often compromising reliability for operational convenience. Method: We conduct a cross-dataset qualitative meta-analysis and methodological critique, systematically identifying twelve recurrent methodological pitfalls. We develop a reproducible evaluation matrix spanning annotation transparency, value positioning, and contextual modeling. Drawing on Max Weber’s “ideal type” theory, we propose a novel three-dimensional framework—value awareness, transparent annotation, and meta-methodological reflection—that integrates classical sociological theory into computational social science data methodology for the first time. Contribution/Results: This work shifts hate speech research from empirically driven practice toward reflexive scholarship, enhancing dataset rigor, interpretability, and ethical accountability. The framework provides actionable guidance for constructing socially responsible, theoretically grounded, and methodologically transparent hate speech datasets.

Technology Category

Application Category

📝 Abstract

The curation of hate speech datasets involves complex design decisions that balance competing priorities. This paper critically examines these methodological choices in a diverse range of datasets, highlighting common themes and practices, and their implications for dataset reliability. Drawing on Max Weber's notion of ideal types, we argue for a reflexive approach in dataset creation, urging researchers to acknowledge their own value judgments during dataset construction, fostering transparency and methodological rigour.

Problem

Research questions and friction points this paper is trying to address.

Examines methodological choices in hate speech datasets

Highlights implications for dataset reliability and practices

Advocates reflexive approach and transparency in dataset creation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Critically examines hate speech dataset methodologies

Advocates reflexive approach using Max Weber's ideal types

Promotes transparency in dataset construction decisions

🔎 Similar Papers

No similar papers found.