Verified Relative Safety Margins for Neural Network Twins

📅 2024-09-25

🏛️ arXiv.org

📈 Citations: 2

✨ Influential: 0

career value

184K/year

🤖 AI Summary

This work addresses the problem of comparing decision robustness between homogenous neural network classifiers—e.g., a pruned, quantized, or distilled model and its original counterpart. We formally introduce the *Relative Safety Margin* (RSM), a quantitative measure capturing the confidence-level disparity between their decision boundaries on identical inputs, thereby enabling verifiable local implication (N₂ ⇒ N₁). Methodologically, we integrate convex relaxation, interval propagation, and class-boundary sensitivity analysis to derive rigorous upper and lower bounds on RSM over input perturbation sets. Unlike conventional equivalence verification, RSM enables fine-grained, quantitative, and formally provable robustness comparison. Experiments on MNIST, CIFAR-10, and two medical imaging datasets demonstrate RSM’s effectiveness across pruning, quantization, and knowledge distillation. The approach significantly improves both accuracy and interpretability in assessing decision consistency and quality of compressed models.

Technology Category

Application Category

📝 Abstract

Given two Deep Neural Network (DNN) classifiers with the same input and output domains, our goal is to quantify the robustness of the two networks in relation to each other. Towards this, we introduce the notion of Relative Safety Margins (RSMs). Intuitively, given two classes and a common input, RSM of one classifier with respect to another reflects the relative margins with which decisions are made. The proposed notion is relevant in the context of several applications domains, including to compare a trained network and its corresponding compact network (e.g., pruned, quantized, distilled network). Not only can RSMs establish whether decisions are preserved, but they can also quantify their qualities. We also propose a framework to establish safe bounds on RSM gains or losses given an input and a family of perturbations. We evaluate our approach using the MNIST, CIFAR10, and two real-world medical datasets, to show the relevance of our results.

Problem

Research questions and friction points this paper is trying to address.

Comparing two neural networks over entire input regions

Establishing formal local implication between network classifiers

Verifying consistent correct decisions across input domains

Innovation

Methods, ideas, or system contributions that make the work stand out.

Formal local implication for neural networks

Sound formulation for verified implications

Comparing original and compact network versions

🔎 Similar Papers

No similar papers found.