Hiding in Plain Sight: Finding MAHA on Reddit

📅 2026-05-19
📈 Citations: 0
Influential: 0
📄 PDF

career value

197K/year
🤖 AI Summary
This study addresses the challenge of systematically identifying and structuring the diverse belief systems underlying the “Make America Healthy Again” (MAHA) movement from vast volumes of unstructured social media data. To this end, it proposes an integrated approach combining large-scale data collection, topic-aligned annotation, and contextual natural language modeling. Leveraging 19.4 million posts authored by 4 million Reddit users between 2020 and 2025, the work constructs the first fine-grained, context-aware structured dataset encompassing twelve distinct MAHA-related belief themes. This resource fills a critical gap in high-quality, structured data for MAHA research and provides a foundational, interdisciplinary basis for analyzing the movement’s discursive structures, diffusion mechanisms, community evolution, and linguistic behaviors.
📝 Abstract
Make America Healthy Again (MAHA) is a national health movement that encompasses a striking mix of beliefs, from broadly accepted concerns about good diet and exercise to controversial takes on organic and genetically modified food, childhood vaccination, science, and institutions. Various influencers and promoters of the MAHA movement on social media are scattered throughout the online space. Investigating the structure, discourse, and contagion of MAHA beliefs requires large-scale fine-grained digital footprints. Constructing structured data covering different MAHA themes from vast unstructured social media data is challenging. We introduce a Reddit dataset that spans six years (2020-2025), comprising 19.4M posts from 4M users. Containing the natural and thematic context of 12 MAHA-aligned beliefs, this dataset offers researchers from various domains the opportunity to study the dynamics of the MAHA movement, its structural and functional components, and the linguistic and behavioral patterns of its proponents.
Problem

Research questions and friction points this paper is trying to address.

MAHA
social media
structured data
belief dynamics
Reddit
Innovation

Methods, ideas, or system contributions that make the work stand out.

structured social media dataset
fine-grained digital footprints
belief theme annotation
online health movement
Reddit data mining