🤖 AI Summary
This study investigates whether platform content moderation can facilitate user deradicalization, specifically examining the impact of banning or quarantining Manosphere subreddits on engagement with “exredpill” recovery communities.
Method: Leveraging publicly available observational data, we employ an event-study design combined with a difference-in-differences framework to construct multidimensional activity metrics—including new-user acquisition and posting volume—before and after moderation interventions.
Contribution/Results: Hard moderation (i.e., subreddit bans) significantly increases new-user enrollment and posting activity in recovery communities, with effects exceeding those of major real-world events; soft moderation (i.e., quarantine) yields no statistically significant impact. Critically, no evidence of toxicity backlash—i.e., increased hostile behavior—was observed following moderation. This study provides the first causal empirical evidence that platform-level bans can serve as effective catalysts for online deradicalization, offering rigorous, policy-relevant insights for content governance and intervention design.
📝 Abstract
Online platforms have sanctioned individuals and communities associated with fringe movements linked to hate speech, violence, and terrorism, but can these sanctions contribute to the abandonment of these movements? Here, we investigate this question through the lens of exredpill, a recovery community on Reddit meant to help individuals leave movements within the Manosphere, a conglomerate of fringe Web based movements focused on men's issues. We conduct an observational study on the impact of sanctioning some of Reddit's largest Manosphere communities on the activity levels and user influx of exredpill, the largest associated recovery subreddit. We find that banning a related radical community positively affects participation in exredpill in the period following the ban. Yet, quarantining the community, a softer moderation intervention, yields no such effects. We show that the effect induced by banning a radical community is stronger than for some of the widely discussed real-world events related to the Manosphere and that moderation actions against the Manosphere do not cause a spike in toxicity or malicious activity in exredpill. Overall, our findings suggest that content moderation acts as a deradicalization catalyst.