Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

📅 2026-02-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitation of traditional Latent Dirichlet Allocation (LDA) in capturing correlations and hierarchical structures among topics due to its use of a symmetric Dirichlet prior. To overcome this, we propose Latent Dirichlet-Tree Allocation (LDTA), which introduces a Dirichlet-Tree prior into the topic modeling framework for the first time. LDTA preserves LDA’s generative structure while explicitly modeling tree-structured dependencies among topic proportions. We derive corresponding mean-field variational inference and expectation propagation algorithms, uncover their vectorized nature, and implement fully vectorized, GPU-accelerated computation. This approach substantially enhances representational capacity over LDA while maintaining scalability, enabling efficient and scalable Bayesian inference.

Technology Category

Application Category

📝 Abstract
Latent Dirichlet Allocation (LDA) is a foundational model for discovering latent thematic structure in discrete data, but its Dirichlet prior cannot represent the rich correlations and hierarchical relationships often present among topics. We introduce the framework of Latent Dirichlet-Tree Allocation (LDTA), a generalization of LDA that replaces the Dirichlet prior with an arbitrary Dirichlet-Tree (DT) distribution. LDTA preserves LDA's generative structure but enables expressive, tree-structured priors over topic proportions. To perform inference, we develop universal mean-field variational inference and Expectation Propagation, providing tractable updates for all DT. We reveal the vectorized nature of the two inference methods through theoretical development, and perform fully vectorized, GPU-accelerated implementations. The resulting framework substantially expands the modeling capacity of LDA while maintaining scalability and computational efficiency.
Problem

Research questions and friction points this paper is trying to address.

Latent Dirichlet Allocation
Dirichlet-Tree
topic correlations
hierarchical relationships
Bayesian inference
Innovation

Methods, ideas, or system contributions that make the work stand out.

Latent Dirichlet-Tree Allocation
Dirichlet-Tree prior
vectorized inference
GPU acceleration
hierarchical topic modeling
🔎 Similar Papers
No similar papers found.
Z
Zheng Wang
Concordia Institute for Information Systems Engineering, Concordia University, Montreal, QC H3G 1M8, Canada
Nizar Bouguila
Nizar Bouguila
Professor
pattern recognitioncomputer visiondata miningimage processingmachine learning