Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection

📅 2025-03-09

📈 Citations: 0

✨ Influential: 0

career value

208K/year

🤖 AI Summary

Existing AI-generated video detection datasets suffer from severe limitations in diversity, authenticity, and complexity, hindering robust forensic analysis in open-world scenarios. To address this, we introduce Chameleon—a novel benchmark dataset that systematically integrates multiple generative models (e.g., SVD, Pika, Runway), diverse real-world source footage, dynamic camera transitions, and non-face-centric content (e.g., action and environmental generation), thereby departing from conventional face-centric detection paradigms. Chameleon leverages motion-aware modeling, cross-domain sampling, and dynamic editing techniques to construct a high-fidelity, generalizable benchmark for video forgery detection. Empirical evaluation demonstrates that models trained on Chameleon achieve significantly improved generalization across generative tools, real-world scenes, and action categories. As the first authoritative benchmark designed explicitly for complex, realistic AIGC video forensics, Chameleon establishes a new standard for evaluating detection robustness beyond controlled, face-focused settings.

Technology Category

Application Category

📝 Abstract

Artificial intelligence generated content (AIGC), known as DeepFakes, has emerged as a growing concern because it is being utilized as a tool for spreading disinformation. While much research exists on identifying AI-generated text and images, research on detecting AI-generated videos is limited. Existing datasets for AI-generated videos detection exhibit limitations in terms of diversity, complexity, and realism. To address these issues, this paper focuses on AI-generated videos detection and constructs a diverse dataset named Chameleon. We generate videos through multiple generation tools and various real video sources. At the same time, we preserve the videos' real-world complexity, including scene switches and dynamic perspective changes, and expand beyond face-centered detection to include human actions and environment generation. Our work bridges the gap between AI-generated dataset construction and real-world forensic needs, offering a valuable benchmark to counteract the evolving threats of AI-generated content.

Problem

Research questions and friction points this paper is trying to address.

Detecting AI-generated videos with diverse scenes and domains.

Addressing limitations in existing datasets for AI video detection.

Expanding detection beyond face-centered to include actions and environments.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Constructs diverse dataset named Chameleon

Generates videos using multiple tools and sources

Expands detection to human actions and environments

🔎 Similar Papers

Detecting AI-Generated Video via Frame Consistency

2024-02-03Citations: 1