Beyond the Lens: Quantifying the Impact of Scientific Documentaries through Amazon Reviews

📅 2025-02-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the lack of large-scale, quantitative methods for assessing public impact of science documentaries. We propose the first fine-grained impact taxonomy tailored to this domain and construct an open-source dataset comprising 1,296 human-annotated sentences drawn from 1,043 viewer comments on six popular Amazon science documentaries. Our methodology integrates machine learning with large language models to jointly identify impact categories—namely, sparking interest, enhancing understanding, and prompting action—and extract associated scientific themes, augmented by quantitative sentiment analysis and expert validation. Experimental results achieve a maximum F1-score of 0.89 and demonstrate strong cross-documentary generalization. Key contributions include: (1) the first fine-grained evaluation framework for science documentary impact; (2) the first publicly available, manually annotated benchmark dataset; and (3) a scalable hybrid modeling paradigm. This work establishes a new empirical foundation and methodological benchmark for evaluating science communication effectiveness.

Technology Category

Application Category

📝 Abstract
Engaging the public with science is critical for a well-informed population. A popular method of scientific communication is documentaries. Once released, it can be difficult to assess the impact of such works on a large scale, due to the overhead required for in-depth audience feedback studies. In what follows, we overview our complementary approach to qualitative studies through quantitative impact and sentiment analysis of Amazon reviews for several scientific documentaries. In addition to developing a novel impact category taxonomy for this analysis, we release a dataset containing 1296 human-annotated sentences from 1043 Amazon reviews for six movies created in whole or part by a team of visualization designers who focus on cinematic presentations of scientific data. Using this data, we train and evaluate several machine learning and large language models, discussing their effectiveness and possible generalizability for documentaries beyond those focused on for this work. Themes are also extracted from our annotated dataset which, along with our large language model analysis, demonstrate a measure of the ability of scientific documentaries to engage with the public.
Problem

Research questions and friction points this paper is trying to address.

Quantify impact of scientific documentaries
Analyze Amazon reviews for sentiment
Develop taxonomy for documentary impact
Innovation

Methods, ideas, or system contributions that make the work stand out.

quantitative impact analysis
sentiment analysis techniques
machine learning models
J
Jill Naiman
Aria Pessianzadeh
Aria Pessianzadeh
Drexel University
Computational social scienceNLPSocial media analysis
Hanyu Zhao
Hanyu Zhao
Alibaba Group
Distributed SystemsSystems for AI
A
AJ Christensen
A
Alistair Nunn
S
Shriya Srikanth
A
Anushka Gami
E
Emma Maxwell
L
Louisa Zhang
S
Sri Nithya Yeragorla
R
R. Rezapour