Verification of Bit-Flip Attacks against Quantized Neural Networks

📅 2025-02-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Formal verification of quantized neural networks (QNNs) against bit-flip attacks remains an open challenge due to the intricate interplay between quantization effects and low-level bit perturbations. Method: We propose BFAVerifier, the first sound, complete, and practically efficient formal verification framework for this task. It introduces a novel dual-path paradigm: abstract interpretation to model quantization behavior and bit-level perturbations, coupled with mixed-integer linear programming (MILP) encoding to precisely capture bit-flip constraints—enabling symbolic reachability analysis. Contribution/Results: BFAVerifier is the first to support either exact vulnerability localization or rigorous robustness certification for *any* QNN parameter. It scales efficiently across diverse architectures, quantization bit-widths (4–8 bits), and attack intensities, achieving 10×–100× speedup over brute-force search. Crucially, it guarantees zero false positives and zero false negatives, overcoming the limitations of empirical evaluation methods.

Technology Category

Application Category

📝 Abstract
In the rapidly evolving landscape of neural network security, the resilience of neural networks against bit-flip attacks (i.e., an attacker maliciously flips an extremely small amount of bits within its parameter storage memory system to induce harmful behavior), has emerged as a relevant area of research. Existing studies suggest that quantization may serve as a viable defense against such attacks. Recognizing the documented susceptibility of real-valued neural networks to such attacks and the comparative robustness of quantized neural networks (QNNs), in this work, we introduce BFAVerifier, the first verification framework designed to formally verify the absence of bit-flip attacks or to identify all vulnerable parameters in a sound and rigorous manner. BFAVerifier comprises two integral components: an abstraction-based method and an MILP-based method. Specifically, we first conduct a reachability analysis with respect to symbolic parameters that represent the potential bit-flip attacks, based on a novel abstract domain with a sound guarantee. If the reachability analysis fails to prove the resilience of such attacks, then we encode this verification problem into an equivalent MILP problem which can be solved by off-the-shelf solvers. Therefore, BFAVerifier is sound, complete, and reasonably efficient. We conduct extensive experiments, which demonstrate its effectiveness and efficiency across various network architectures, quantization bit-widths, and adversary capabilities.
Problem

Research questions and friction points this paper is trying to address.

Verify resilience of quantized neural networks against bit-flip attacks.
Introduce BFAVerifier for formal verification of bit-flip attack vulnerabilities.
Evaluate BFAVerifier's effectiveness across diverse network architectures.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Verification framework for bit-flip attacks
Abstraction-based reachability analysis
MILP-based problem encoding
🔎 Similar Papers
No similar papers found.
Yedi Zhang
Yedi Zhang
National University of Singapore
Formal MethodsTrustworthy AI
L
Lei Huang
ShanghaiTech University, China
P
Pengfei Gao
ByteDance Inc, China
F
Fu Song
Key Laboratory of System Software (Chinese Academy of Sciences) and State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, China
J
Jun Sun
Singapore Management University, Singapore
Jin Song Dong
Jin Song Dong
Professor of Computer Science, National University of Singapore
Formal MethodsTrusted AISafe AIModel CheckingSports Analytics