Reddit's Globalization over Twenty Years: Inferring Community Time Zone from Activity Timestamps

📅 2026-05-05
📈 Citations: 0
Influential: 0
📄 PDF

career value

207K/year
📝 Abstract
Online communities are a global phenomenon, but assessing their actual geographical spread requires accurate and scalable measurement. We propose and evaluate methods that infer the time zone of online communities solely from their temporal activity patterns, requiring nothing beyond hourly activity counts. Grounding our approach in the well-established finding that posting rhythms encode circadian structure, we compare time-domain and frequency-domain methods against a parsimonious heuristic: that activity reaches its minimum around 4 a.m. local time. On Reddit, we show that the best-performing method is accurate to a sub-30-minute resolution, and that fewer than a thousand comments are sufficient to reach peak performance. Similarly, our heuristic almost matches the accuracy of more complex methods, recovering the correct time zone within a one-hour margin on average. This simple method correlates significantly with the actual distribution of Reddit's geographical spread; we validate its generalizability across communities organized around diverse cultural phenomena, from sports to finance, and apply it at scale to characterize the geographic evolution of Reddit from its founding to the present. Our method is portable across platforms and requires no user disclosure, making it a practical baseline for any study that must account for the geographic structure of online behavior.
Problem

Research questions and friction points this paper is trying to address.

time zone inference
online communities
geographic spread
activity timestamps
circadian rhythms
Innovation

Methods, ideas, or system contributions that make the work stand out.

time zone inference
online communities
temporal activity patterns
circadian rhythms
geographic scalability