Dormant: Defending against Pose-driven Human Image Animation

📅 2024-09-22
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To mitigate the misuse risk of single-image-driven pose animation—such as generating illegal content (e.g., politically sensitive or violent videos)—this paper proposes the first adversarial image protection method specifically designed for this task. Our approach injects imperceptible, minimal perturbations that significantly degrade animation quality while preserving the original image’s semantic integrity. Technically, we innovatively model two failure mechanisms: erroneous appearance feature extraction and inter-frame consistency disruption, ensuring robustness against white-box attacks and compatibility with black-box commercial APIs. We conduct comprehensive evaluations across eight state-of-the-art animation models, four benchmark datasets, and six commercial animation APIs. Results demonstrate consistent superiority over six baseline methods: generated animations exhibit perceptible failures—including identity misalignment, structural artifacts, and temporal incoherence—thereby effectively deterring unauthorized or malicious video generation.

Technology Category

Application Category

📝 Abstract
Pose-driven human image animation has achieved tremendous progress, enabling the generation of vivid and realistic human videos from just one single photo. However, it conversely exacerbates the risk of image misuse, as attackers may use one available image to create videos involving politics, violence and other illegal content. To counter this threat, we propose Dormant, a novel protection approach tailored to defend against pose-driven human image animation techniques. Dormant applies protective perturbation to one human image, preserving the visual similarity to the original but resulting in poor-quality video generation. The protective perturbation is optimized to induce misextraction of appearance features from the image and create incoherence among the generated video frames. Our extensive evaluation across 8 animation methods and 4 datasets demonstrates the superiority of Dormant over 6 baseline protection methods, leading to misaligned identities, visual distortions, noticeable artifacts, and inconsistent frames in the generated videos. Moreover, Dormant shows effectiveness on 6 real-world commercial services, even with fully black-box access.
Problem

Research questions and friction points this paper is trying to address.

Defending pose-driven image misuse
Protecting against illegal video creation
Ensuring video generation quality degradation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Protective perturbation to human images
Optimized for misextraction of features
Effective across multiple animation methods
J
Jiachen Zhou
Institute of Information Engineering, Chinese Academy of Sciences, China; School of Cyber Security, University of Chinese Academy of Sciences, China
Mingsi Wang
Mingsi Wang
Institute of Information Engineering, Chinese Academy of Sciences
AI security
Tianlin Li
Tianlin Li
Nanyang Technological University
AI4SESE4AITrustworthy AI
Guozhu Meng
Guozhu Meng
Associate Professor with Chinese Academy of Sciences
mobile securityprogram analysisAI privacy and security
K
Kai Chen
Institute of Information Engineering, Chinese Academy of Sciences, China; School of Cyber Security, University of Chinese Academy of Sciences, China