Analog Bayesian neural networks are insensitive to the shape of the weight distribution

📅 2025-01-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of complex device-level stochastic error modeling when implementing Bayesian neural networks (BNNs) in analog hardware. Specifically, it investigates how the shape of the weight distribution—under mean-field variational inference (MFVI)—affects predictive performance. Through extensive MFVI training and uncertainty quantification experiments under realistic analog device noise (Gaussian, uniform, and impulse noise), we empirically demonstrate for the first time that the BNN’s predictive distribution depends solely on the first two moments (mean and variance) of the weight posterior, irrespective of the specific parametric form of the variational approximation. This finding substantially relaxes the requirement for precise characterization of noise distribution morphology in analog hardware design. Experimental results show that prediction errors across diverse noise distributions differ by less than 0.5%, significantly enhancing the feasibility and robustness of analog BNN deployment.

Technology Category

Application Category

📝 Abstract
Recent work has demonstrated that Bayesian neural networks (BNN's) trained with mean field variational inference (MFVI) can be implemented in analog hardware, promising orders of magnitude energy savings compared to the standard digital implementations. However, while Gaussians are typically used as the variational distribution in MFVI, it is difficult to precisely control the shape of the noise distributions produced by sampling analog devices. This paper introduces a method for MFVI training using real device noise as the variational distribution. Furthermore, we demonstrate empirically that the predictive distributions from BNN's with the same weight means and variances converge to the same distribution, regardless of the shape of the variational distribution. This result suggests that analog device designers do not need to consider the shape of the device noise distribution when hardware-implementing BNNs performing MFVI.
Problem

Research questions and friction points this paper is trying to address.

Bayesian Neural Networks
Random Errors
Prediction Accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian Neural Networks
Inherent Randomness
Low-power Analog Devices
🔎 Similar Papers
No similar papers found.
R
Ravi G. Patel
Sandia National Laboratories
T
T. Xiao
Sandia National Laboratories
Sapan Agarwal
Sapan Agarwal
Sandia National Laboratories
Semiconductors
C
Christopher H. Bennett
Sandia National Laboratories