Zeitgeist-Aware Multimodal (ZAM) Datasets of Pro-Eating Disorder Short-Form Videos: An Idea Worth Researching

๐Ÿ“… 2026-04-21
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

215K/year
๐Ÿค– AI Summary
Existing approaches predominantly rely on textual signals and struggle to effectively capture the multimodal characteristics of pro-eating disorder content in short videos or its rapidly evolving cultural context. To address this gap, this work introduces the concept of โ€œzeitgeist-awareโ€ detection and presents ZAM, the first dynamically evolving, expert-annotated multimodal dataset for pro-eating disorder content. ZAM integrates multimodal analysis, expert annotation, adaptive inclusion criteria, and continuous data collection to enable real-time detection and research of short-form harmful content. By doing so, it fills a critical void in dynamic multimodal benchmarking and provides a scalable foundation for interdisciplinary research and responsive content moderation systems.

Technology Category

Application Category

๐Ÿ“ Abstract
Objective: Reliable identification of pro-eating disorder (pro-ED) content online suffers from two pervasive problems: 1) existing methods predominantly rely on text-based signals, failing to capture the inherently multimodal nature of multimedia content; and 2) these methods struggle to keep pace with the rapid evolution of references, memes, terminology, and contextual cues that underlie this content. Together, these limitations point to a gap: the absence of an expert-annotated reference standard capable of supporting real-time research and robust multimodal detection model training for pro-ED content on short-form video platforms. Method: To address this, we propose "zeitgeist-aware" multimodal (ZAM) datasets: continuously curated collections of annotated multimodal pro-ED content with inclusion criteria that evolve alongside the memetic zeitgeist: the variable essence of what is considered pro-ED as new media and references come into the cultural zeitgeist and are absorbed and interpreted in online spaces. Results: We present a rationale for such datasets, define their core characteristics, outline approaches for their curation, and describe our progress toward that end. Discussion: This dataset and pipeline architecture may benefit researchers across several fields who are interested in how pro-ED sentiment is encoded and transmitted through short-form video content across time, including for the purpose of responsive moderation efforts.
Problem

Research questions and friction points this paper is trying to address.

pro-eating disorder
multimodal content
zeitgeist-aware
short-form videos
dataset gap
Innovation

Methods, ideas, or system contributions that make the work stand out.

zeitgeist-aware
multimodal dataset
pro-eating disorder
short-form video
dynamic annotation
E
Eden Shaveet
Department of Information Science, Cornell University, New York, USA
Z
Zefan Sramek
Department of Electrical Engineering and Information Systems, The University of Tokyo, Tokyo, Japan | Department of Information and Communication Engineering, The University of Tokyo, Tokyo, Japan
Y
Yumi Hamamoto
Frontier Research Institute for Interdisciplinary Sciences (FRIS), Tohoku University, Sendai, Japan | Research Institute of Electrical Communication (RIEC), Tohoku University, Sendai, Japan
Jing Du
Jing Du
Postdoctoral Research Fellow at University of New South Wales
Recommender SystemBrain Network AnalysisDisease Spread PredicionTime-Series Analysis
S
Scott Griffiths
Melbourne School of Psychological Sciences, University of Melbourne, Melbourne, Australia
T
Thalia Zhang
Department of Information Science, Cornell University, New York, USA
Thalia Viranda
Thalia Viranda
Cornell Tech
mobile sensingdigital health interventionself-controlemotion regulationeating disorders
W
William Hornby
williamhornby.com, USA
Flora Salim
Flora Salim
Professor, CSE, UNSW
Machine LearningTime SeriesSpatiotemporalUbiCompFoundation Models
Koji Yatani
Koji Yatani
University of Tokyo
Human-Computer InteractionUbiquitous ComputingAI/IoT ApplicationsDigital HealthcareUsable Security
Tanzeem Choudhury
Tanzeem Choudhury
Professor, Computing and Information Science, Cornell Tech
digital healthubiquitous computingmobile systems