Variational Inference in Location-Scale Families: Exact Recovery of the Mean and Correlation Matrix

📅 2024-10-14

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

193K/year

🤖 AI Summary

This paper investigates the robustness of variational inference (VI) under model misspecification: specifically, whether KL-minimizing VI accurately recovers the mean and correlation matrix of a true posterior density $p$ when $p$ is even-symmetric or ellipsoidally symmetric, and the variational family $Q$ is the corresponding location-scale family—potentially excluding $p$. The authors establish, for the first time, rigorous guarantees: under even symmetry, KL-minimizing VI exactly recovers the mean of $p$; under ellipsoidal symmetry, it exactly recovers the correlation matrix. These results hold robustly across common misspecifications, including factorized approximations and heavy-/light-tailed mismatches. The analysis leverages functional extremal characterizations under symmetry constraints. Empirical evaluation confirms smooth degradation of estimation error as symmetry weakens. Collectively, this work provides novel theoretical foundations and design principles for Bayesian approximate inference.

Technology Category

Application Category

📝 Abstract

Given an intractable target density $p$, variational inference (VI) attempts to find the best approximation $q$ from a tractable family $Q$. This is typically done by minimizing the exclusive Kullback-Leibler divergence, $ ext{KL}(q||p)$. In practice, $Q$ is not rich enough to contain $p$, and the approximation is misspecified even when it is a unique global minimizer of $ ext{KL}(q||p)$. In this paper, we analyze the robustness of VI to these misspecifications when $p$ exhibits certain symmetries and $Q$ is a location-scale family that shares these symmetries. We prove strong guarantees for VI not only under mild regularity conditions but also in the face of severe misspecifications. Namely, we show that (i) VI recovers the mean of $p$ when $p$ exhibits an extit{even} symmetry, and (ii) it recovers the correlation matrix of $p$ when in addition~$p$ exhibits an extit{elliptical} symmetry. These guarantees hold for the mean even when $q$ is factorized and $p$ is not, and for the correlation matrix even when~$q$ and~$p$ behave differently in their tails. We analyze various regimes of Bayesian inference where these symmetries are useful idealizations, and we also investigate experimentally how VI behaves in their absence.

Problem

Research questions and friction points this paper is trying to address.

Analyzes robustness of variational inference to model misspecifications.

Proves VI recovers mean under even symmetry conditions.

Shows VI recovers correlation matrix under elliptical symmetry.

Innovation

Methods, ideas, or system contributions that make the work stand out.

VI recovers mean under even symmetry.

VI recovers correlation matrix under elliptical symmetry.

VI robust against severe misspecifications.

🔎 Similar Papers

A Unified Theory of Exact Inference and Learning in Exponential Family Latent Variable Models

2024-04-30arXiv.orgCitations: 1

A variational Bayes approach to debiased inference for low-dimensional parameters in high-dimensional linear regression

2024-06-18arXiv.orgCitations: 0

Robust and highly scalable estimation of directional couplings from time-shifted signals

2024-06-04arXiv.orgCitations: 0