π€ AI Summary
This work addresses the high computational cost of Bayesian inference with projection priors, which typically require nested iterative optimization. The authors propose a continuously relaxed projection prior that eliminates the need for inner-loop optimization during posterior updates by introducing a duality gap and placing a probabilistic prior on it to drive its contraction toward zero. This approach is the first to incorporate a continuous shrinkage mechanism into projection priors, yielding a differentiable, inner-loop-free approximate prior. The method also establishes a theoretical connection to globalβlocal shrinkage priors. Empirical results demonstrate competitive posterior shrinkage performance, and the approach is successfully applied to marketing data analysis of multivariate shopping decisions, effectively identifying key predictive factors.
π Abstract
Projected priors were originally introduced to accommodate parameter constraints, but have recently regained popularity due to their ability to assign probability mass to low-dimensional parameter sets, such as the spaces of sparse vectors, directed acyclic graphs, or transport plans. When employed as a transformation of random variables, projection is especially useful, since its contraction property not only preserves probability concentration, but also often preserves differentiability for gradient-based posterior computation. On the other hand, unless the projection can be obtained by some non-iterative algorithm, posterior computation can be expensive because it requires nesting an iterative optimization routine within each Markov chain Monte Carlo iteration. In this article, inspired by the success of continuous shrinkage models as replacements for discrete spike-and-slab priors, we propose a continuous relaxation of projected priors. The key idea is to quantify the duality gap between the primal projection loss and the dual objective, and impose a probabilistic prior that shrinks this gap toward zero. The resulting gap-shrinkage prior has a tractable form, does not require running an optimization subroutine inside each posterior update, and puts probability mass near the exact projection. We demonstrate useful properties of gap-shrinkage priors, including connections to global-local shrinkage priors, broad applicability to generalized projection functions, and competitive performance in posterior contraction. We apply the gap-shrinkage model to a marketing data analysis aimed at identifying important predictor effects on multivariate grocery-shopping decisions.