Defending against Model Inversion Attacks via Random Erasing

📅 2024-09-02

🏛️ arXiv.org

📈 Citations: 1

✨ Influential: 0

career value

194K/year

🤖 AI Summary

To address privacy leakage from model inversion (MI) attacks, this paper pioneers the adaptation of Random Erasing (RE)—originally a data augmentation technique—into a data-level privacy defense mechanism. During training, RE probabilistically erases random rectangular regions in input images, thereby actively degrading the model’s capacity to encode fine-grained private details. The method requires no architectural modifications or loss-function alterations, ensuring orthogonality and compatibility with existing defenses, and effectively alleviates the privacy–utility trade-off. Evaluated across 23 diverse experimental settings, our approach achieves state-of-the-art privacy–utility balance: reconstructed images suffer a substantial PSNR degradation of 12.6 dB, while classification accuracy drops by less than 1.2%. It consistently outperforms mainstream defense strategies in both privacy preservation and task performance.

Technology Category

Application Category

📝 Abstract

Model Inversion (MI) is a type of privacy violation that focuses on reconstructing private training data through abusive exploitation of machine learning models. To defend against MI attacks, state-of-the-art (SOTA) MI defense methods rely on regularizations that conflict with the training loss, creating explicit tension between privacy protection and model utility. In this paper, we present a new method to defend against MI attacks. Our method takes a new perspective and focuses on training data. Our idea is based on a novel insight on Random Erasing (RE), which has been applied in the past as a data augmentation technique to improve the model accuracy under occlusion. In our work, we instead focus on applying RE for degrading MI attack accuracy. Our key insight is that MI attacks require significant amount of private training data information encoded inside the model in order to reconstruct high-dimensional private images. Therefore, we propose to apply RE to reduce private information presented to the model during training. We show that this can lead to substantial degradation in MI reconstruction quality and attack accuracy. Meanwhile, natural accuracy of the model is only moderately affected. Our method is very simple to implement and complementary to existing defense methods. Our extensive experiments of 23 setups demonstrate that our method can achieve SOTA performance in balancing privacy and utility of the models. The results consistently demonstrate the superiority of our method over existing defenses across different MI attacks, network architectures, and attack configurations.

Problem

Research questions and friction points this paper is trying to address.

Exploring Random Erasing as defense against Model Inversion attacks

Analyzing feature space discrepancy to degrade MI reconstruction quality

Achieving privacy-utility trade-off with Partial Erasure and Random Location

Innovation

Methods, ideas, or system contributions that make the work stand out.

Random Erasing disrupts Model Inversion feature reconstruction

Partial Erasure and Random Location enhance privacy-utility trade-off

Feature space analysis shows effective defense with maintained accuracy

🔎 Similar Papers

No similar papers found.