Improving Transferability of Adversarial Examples via Bayesian Attacks

📅 2023-07-21

🏛️ IEEE transactions on circuits and systems for video technology (Print)

📈 Citations: 2

✨ Influential: 0

career value

232K/year

🤖 AI Summary

To address the weak cross-model transferability of adversarial examples in black-box settings, this paper proposes, for the first time, a Bayesian randomization framework jointly modeling the input space and model parameter space. Our method co-designs input posterior approximations and parameter priors, integrating variational inference with stochastic adversarial optimization, while implicitly encouraging joint flat minima in the parameter–input space—thereby significantly enhancing transfer generalization. Crucially, the approach requires no access to or fine-tuning of the target model. Evaluated on ImageNet and CIFAR-10, it achieves transfer attack success rates that surpass the ICLR baseline by 19.14% and 2.08%, respectively, establishing new state-of-the-art performance.

📝 Abstract

This paper presents a substantial extension of our work published at ICLR. Our ICLR work advocated for enhancing transferability in adversarial examples by incorporating a Bayesian formulation into model parameters, which effectively emulates the ensemble of infinitely many deep neural networks, while, in this paper, we introduce a novel extension by incorporating the Bayesian formulation into the model input as well, enabling the joint diversification of both the model input and model parameters. Our empirical findings demonstrate that: 1) the combination of Bayesian formulations for both the model input and model parameters yields significant improvements in transferability; 2) by introducing advanced approximations of the posterior distribution over the model input, adversarial transferability achieves further enhancement, surpassing all state-of-the-arts when attacking without model fine-tuning. Moreover, we propose a principled approach to fine-tune model parameters in such an extended Bayesian formulation. The derived optimization objective inherently encourages flat minima in the parameter space and input space. Extensive experiments demonstrate that our method achieves a new state-of-the-art on transfer-based attacks, improving the average success rate on ImageNet and CIFAR-10 by 19.14% and 2.08%, respectively, when comparing with our ICLR basic Bayesian method. We will make our code publicly available.

Problem

Research questions and friction points this paper is trying to address.

Enhancing adversarial example transferability across unknown deep neural networks

Jointly diversifying model parameters and inputs using Bayesian formulation

Improving transfer-based attack success rates without model fine-tuning

Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian formulation diversifies model parameters and input

Advanced posterior approximations enhance adversarial transferability

Principled fine-tuning within Bayesian framework improves attack success

🔎 Similar Papers

No similar papers found.

Bosch Group

Renningen, BW, DE

PhD – Generative Models for Closed-loop Synthesis

Bosch Group

Renningen, BW, DE

Authors to Follow