When Rules Fall Short: Agent-Driven Discovery of Emerging Content Issues in Short Video Platforms

📅 2026-01-14
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the rapidly evolving content risks on short-video platforms, where traditional manual detection mechanisms suffer from low efficiency and delayed policy updates, thereby limiting governance effectiveness. The authors propose the first automated problem discovery framework based on a multimodal large language model (LLM) agent. The approach identifies potentially violating videos, applies a two-stage clustering method to generate emerging issue clusters, and automatically produces executable annotation policies. This work pioneers the use of multimodal LLM agents for defining and detecting novel content risks, enabling rapid iteration of annotation strategies. Experimental results demonstrate that, compared to manual processes, the proposed method improves the F1 score for emerging issue detection by over 20%, reduces associated video views by approximately 15%, and significantly shortens the policy update cycle.

Technology Category

Application Category

📝 Abstract
Trends on short-video platforms evolve at a rapid pace, with new content issues emerging every day that fall outside the coverage of existing annotation policies. However, traditional human-driven discovery of emerging issues is too slow, which leads to delayed updates of annotation policies and poses a major challenge for effective content governance. In this work, we propose an automatic issue discovery method based on multimodal LLM agents. Our approach automatically recalls short videos containing potential new issues and applies a two-stage clustering strategy to group them, with each cluster corresponding to a newly discovered issue. The agent then generates updated annotation policies from these clusters, thereby extending coverage to these emerging issues. Our agent has been deployed in the real system. Both offline and online experiments demonstrate that this agent-based method significantly improves the effectiveness of emerging-issue discovery (with an F1 score improvement of over 20%) and enhances the performance of subsequent issue governance (reducing the view count of problematic videos by approximately 15%). More importantly, compared to manual issue discovery, it greatly reduces time costs and substantially accelerates the iteration of annotation policies.
Problem

Research questions and friction points this paper is trying to address.

emerging content issues
short video platforms
annotation policies
content governance
issue discovery
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal LLM agents
emerging issue discovery
two-stage clustering
annotation policy generation
short video content governance
🔎 Similar Papers
No similar papers found.
C
Chenghui Yu
TikTok Inc., San Jose, CA, USA
H
Hongwei Wang
TikTok Inc., San Jose, CA, USA
J
Junwen Chen
TikTok Inc., San Jose, CA, USA
Z
Zixuan Wang
TikTok Inc., San Jose, CA, USA
B
Bingfeng Deng
TikTok Inc., Singapore
Zhuolin Hao
Zhuolin Hao
Bytedance
Hongyu Xiong
Hongyu Xiong
Stanford University
Image ProcessingApplied Machine LearningComputer VisionSmart Manufacturing
Y
Yang Song
TikTok Inc., San Jose, CA, USA