Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection

📅 2024-08-21
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address copyright infringement risks in 3D geometric reconstruction induced by multi-view diffusion models (MVDMs), this work proposes the first dual-erasure adversarial attack targeting latent features and cross-view attention. The method simultaneously perturbs latent-space feature distributions and masks cross-view attention weights, thereby disrupting both geometric and visual consistency across views and domains. Its key contributions are: (1) establishing the first differentiable, dual-dimensional attack paradigm specifically designed for MVDMs; (2) pioneering the explicit modeling and adversarial manipulation of cross-view attention to dismantle 3D structural consistency; and (3) integrating multi-domain consistency constraints to enhance transferability and robustness against defenses. Evaluated on state-of-the-art MVDMs, the attack achieves a success rate of 92.7%, an average cross-model transfer success rate of 86.4%, and maintains over 78% efficacy against strong defenses such as DiffPure.

Technology Category

Application Category

📝 Abstract
Multi-View Diffusion Models (MVDMs) enable remarkable improvements in the field of 3D geometric reconstruction, but the issue regarding intellectual property has received increasing attention due to unauthorized imitation. Recently, some works have utilized adversarial attacks to protect copyright. However, all these works focus on single-image generation tasks which only need to consider the inner feature of images. Previous methods are inefficient in attacking MVDMs because they lack the consideration of disrupting the geometric and visual consistency among the generated multi-view images. This paper is the first to address the intellectual property infringement issue arising from MVDMs. Accordingly, we propose a novel latent feature and attention dual erasure attack to disrupt the distribution of latent feature and the consistency across the generated images from multi-view and multi-domain simultaneously. The experiments conducted on SOTA MVDMs indicate that our approach achieves superior performances in terms of attack effectiveness, transferability, and robustness against defense methods. Therefore, this paper provides an efficient solution to protect 3D assets from MVDMs-based 3D geometry reconstruction.
Problem

Research questions and friction points this paper is trying to address.

Protect 3D assets from unauthorized MVDMs-based reconstruction
Disrupt latent feature distribution in multi-view diffusion models
Break geometric consistency across generated multi-view images
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual erasure attack on latent features
Disrupts multi-view image consistency
Effective against 3D reconstruction models
🔎 Similar Papers
No similar papers found.
J
Jingwei Sun
Institution of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an, China
X
Xuchong Zhang
Institution of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an, China
C
Changfeng Sun
Institution of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an, China
Q
Qicheng Bai
Institution of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an, China
Hongbin Sun
Hongbin Sun
Xi'an Jiaotong University
Computer ArchitectureVLSI Circuit