ReVeil: Unconstrained Concealed Backdoor Attack on Deep Neural Networks using Machine Unlearning

📅 2025-02-17

📈 Citations: 0

✨ Influential: 0

career value

212K/year

🤖 AI Summary

Existing stealthy backdoor attacks rely on white-box/black-box model access or auxiliary data, limiting their practicality. To address this, we propose ReVeil—the first unconstrained stealthy backdoor attack injected during data collection, requiring neither model access nor additional data. Its core innovation is the integration of machine unlearning: during pre-deployment, trigger samples induce unlearning to suppress the attack success rate (ASR) to ≤6.5%, enabling evasion of three major detection paradigms; upon deployment, forgetting reversal restores ASR to >92%. ReVeil combines trigger pattern injection with robustness across multiple datasets and diverse trigger designs. We validate its efficacy on four benchmark datasets and four distinct trigger patterns. Crucially, ReVeil is the first to synergistically leverage machine unlearning for both backdoor activation and stealth, thereby overcoming key practicality bottlenecks in stealthy backdoor attacks.

Technology Category

Application Category

📝 Abstract

Backdoor attacks embed hidden functionalities in deep neural networks (DNN), triggering malicious behavior with specific inputs. Advanced defenses monitor anomalous DNN inferences to detect such attacks. However, concealed backdoors evade detection by maintaining a low pre-deployment attack success rate (ASR) and restoring high ASR post-deployment via machine unlearning. Existing concealed backdoors are often constrained by requiring white-box or black-box access or auxiliary data, limiting their practicality when such access or data is unavailable. This paper introduces ReVeil, a concealed backdoor attack targeting the data collection phase of the DNN training pipeline, requiring no model access or auxiliary data. ReVeil maintains low pre-deployment ASR across four datasets and four trigger patterns, successfully evades three popular backdoor detection methods, and restores high ASR post-deployment through machine unlearning.

Problem

Research questions and friction points this paper is trying to address.

Concealed backdoor attacks on DNNs

Evading detection with machine unlearning

No model access or auxiliary data needed

Innovation

Methods, ideas, or system contributions that make the work stand out.

Concealed backdoor attack on DNNs

No model access or auxiliary data

Machine unlearning for high ASR

🔎 Similar Papers

Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning