TarPro: Targeted Protection against Malicious Image Editing

📅 2025-03-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the security risk of image editing models being misused for generating NSFW content—where existing defenses struggle to simultaneously block malicious edits and preserve fidelity of legitimate edits—this paper proposes a semantic-aware selective protection mechanism. First, a semantic-constrained module is introduced to precisely identify potential adversarial editing intents. Second, a lightweight learnable perturbation generator is designed to apply differentiated interventions specifically on harmful editing pathways. Third, a robustness-optimized training strategy is incorporated to enhance generalization across diverse models and scenarios. Evaluated on multiple state-of-the-art editing models and real-world benchmarks, our method achieves a 96.2% malicious-edit blocking rate while incurring less than 1.3% fidelity degradation on benign edits—outperforming all prior approaches. To our knowledge, this is the first work to enable fine-grained, semantics-driven discrimination and protection of editing behaviors, establishing a new paradigm for secure and controllable image editing.

Technology Category

Application Category

📝 Abstract
The rapid advancement of image editing techniques has raised concerns about their misuse for generating Not-Safe-for-Work (NSFW) content. This necessitates a targeted protection mechanism that blocks malicious edits while preserving normal editability. However, existing protection methods fail to achieve this balance, as they indiscriminately disrupt all edits while still allowing some harmful content to be generated. To address this, we propose TarPro, a targeted protection framework that prevents malicious edits while maintaining benign modifications. TarPro achieves this through a semantic-aware constraint that only disrupts malicious content and a lightweight perturbation generator that produces a more stable, imperceptible, and robust perturbation for image protection. Extensive experiments demonstrate that TarPro surpasses existing methods, achieving a high protection efficacy while ensuring minimal impact on normal edits. Our results highlight TarPro as a practical solution for secure and controlled image editing.
Problem

Research questions and friction points this paper is trying to address.

Prevent malicious image edits while preserving normal edits
Balance between blocking harmful content and maintaining editability
Develop a robust and imperceptible image protection framework
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantic-aware constraint blocks malicious edits
Lightweight perturbation generator ensures robust protection
Maintains normal editability while preventing harmful content
🔎 Similar Papers
No similar papers found.