DEEP: A Discourse Evolution Engine for Predictions about Social Movements

📅 2025-11-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates the mechanisms by which critical events shape public opinion evolution in social movements—supporting UN Sustainable Development Goals. We propose DEEP, a Transformer-based probabilistic multivariate time-series forecasting framework that jointly models news and social media data to simultaneously predict both volume (e.g., article/post counts) and sentiment polarity of public discourse, while quantifying predictive uncertainty. Methodologically, DEEP innovatively integrates large-scale domain-specific corpus construction, fine-grained sentiment analysis, and cross-platform temporal joint training. Empirically validated on the #MeToo movement, DEEP accurately captures event-driven sentiment shifts and diffusion dynamics. We curate and publicly release a longitudinal dataset comprising 433K Reddit posts and 121K news articles. This work delivers an interpretable, confidence-aware, data-driven tool for social movement monitoring, editorial decision-making, and policy response.

Technology Category

Application Category

📝 Abstract
Numerous social movements (SMs) around the world help support the UN's Sustainable Development Goals (SDGs). Understanding how key events shape SMs is key to the achievement of the SDGs. We have developed SMART (Social Media Analysis & Reasoning Tool) to track social movements related to the SDGs. SMART was designed by a multidisciplinary team of AI researchers, journalists, communications scholars and legal experts. This paper describes SMART's transformer-based multivariate time series Discourse Evolution Engine for Predictions about Social Movements (DEEP) to predict the volume of future articles/posts and the emotions expressed. DEEP outputs probabilistic forecasts with uncertainty estimates, providing critical support for editorial planning and strategic decision-making. We evaluate DEEP with a case study of the #MeToo movement by creating a novel longitudinal dataset (433K Reddit posts and 121K news articles) from September 2024 to June 2025 that will be publicly released for research purposes upon publication of this paper.
Problem

Research questions and friction points this paper is trying to address.

Predict future article volume for social movements
Forecast emotional expressions in movement discourse
Provide uncertainty-aware probabilistic movement predictions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer-based multivariate time series analysis engine
Probabilistic forecasts with uncertainty estimates
Novel longitudinal dataset for movement prediction
🔎 Similar Papers
No similar papers found.
Valerio La Gatta
Valerio La Gatta
Northwestern University
disinformation miningexplainable AImultimodal machine learning
Marco Postiglione
Marco Postiglione
Postdoc, Northwestern University
Deep LearningNatural Language ProcessingGraph Analytics
J
Jeremy Gilbert
Medill School of Journalism, Media, Integrated Marketing Communications, Northwestern University
D
Daniel W. Linna Jr.
Northwestern Pritzker School of Law and Department of Computer Science, Northwestern University
M
Morgan Manella Greenfield
The Wall Street Journal
Aaron Shaw
Aaron Shaw
Associate Professor, Communication Studies, Northwestern University
Online communitiesPeer productionOrganizations
V
V. S. Subrahmanian
Department of Computer Science and Buffett Institute for Global Affairs, Northwestern University