🤖 AI Summary
Traditional VAEs and Poisson-VAEs fail to capture neural spike count overdispersion—i.e., variance substantially exceeding the mean. To address this, we propose NegBio-VAE, the first VAE that incorporates the negative binomial distribution in its decoder to explicitly model a learnable dispersion parameter, thereby relaxing the rigid mean-equals-variance constraint inherent to the Poisson distribution. We develop a differentiable reparameterization scheme for the negative binomial distribution and introduce two ELBO optimization strategies to enable end-to-end variational inference. Experiments on multi-source neural electrophysiological datasets demonstrate that introducing only a single learnable dispersion parameter significantly improves spike count reconstruction accuracy. NegBio-VAE consistently outperforms Poisson-VAE and other baseline models on overdispersed neural data modeling tasks, establishing a new state-of-the-art for probabilistic spike train representation learning.
📝 Abstract
Biological neurons communicate through spike trains, discrete, irregular bursts of activity that exhibit variability far beyond the modeling capacity of conventional variational autoencoders (VAEs). Recent work, such as the Poisson-VAE, makes a biologically inspired move by modeling spike counts using the Poisson distribution. However, they impose a rigid constraint: equal mean and variance, which fails to reflect the true stochastic nature of neural activity. In this work, we challenge this constraint and introduce NegBio-VAE, a principled extension of the VAE framework that models spike counts using the negative binomial distribution. This shift grants explicit control over dispersion, unlocking a broader and more accurate family of neural representations. We further develop two ELBO optimization schemes and two differentiable reparameterization strategies tailored to the negative binomial setting. By introducing one additional dispersion parameter, NegBio-VAE generalizes the Poisson latent model to a negative binomial formulation. Empirical results demonstrate this minor yet impactful change leads to significant gains in reconstruction fidelity, highlighting the importance of explicitly modeling overdispersion in spike-like activations.