🤖 AI Summary
Existing generative models for density estimation suffer from limited interpretability and difficulty in quantifying uncertainty, while traditional Bayesian nonparametric methods—such as Markov chain Monte Carlo (MCMC)—exhibit high computational cost and poor scalability to deep learning frameworks. To address these challenges, we propose the Variational Polya Tree (VPT) model: the first framework that tightly integrates the Polya tree Bayesian nonparametric prior with stochastic variational inference, yielding a differentiable, end-to-end trainable density estimator. By optimizing the joint distribution likelihood, VPT achieves more accurate posterior approximation and overcomes the efficiency bottlenecks inherent in MCMC. Empirically, VPT scales effectively to high-dimensional data and attains competitive performance with state-of-the-art deep density estimators on both real-world tabular and image data. Crucially, it significantly enhances model interpretability and enables principled uncertainty quantification. The implementation is publicly available.
📝 Abstract
Density estimation is essential for generative modeling, particularly with the rise of modern neural networks. While existing methods capture complex data distributions, they often lack interpretability and uncertainty quantification. Bayesian nonparametric methods, especially the polya tree, offer a robust framework that addresses these issues by accurately capturing function behavior over small intervals. Traditional techniques like Markov chain Monte Carlo (MCMC) face high computational complexity and scalability limitations, hindering the use of Bayesian nonparametric methods in deep learning. To tackle this, we introduce the variational polya tree (VPT) model, which employs stochastic variational inference to compute posterior distributions. This model provides a flexible, nonparametric Bayesian prior that captures latent densities and works well with stochastic gradient optimization. We also leverage the joint distribution likelihood for a more precise variational posterior approximation than traditional mean-field methods. We evaluate the model performance on both real data and images, and demonstrate its competitiveness with other state-of-the-art deep density estimation methods. We also explore its ability in enhancing interpretability and uncertainty quantification. Code is available at https://github.com/howardchanth/var-polya-tree.