A New Family of Poisson Non-negative Matrix Factorization Methods Using the Shifted Log Link

πŸ“… 2026-01-09
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Traditional Poisson non-negative matrix factorization (Poisson NMF) assumes that data are generated additively from β€œparts,” which constrains the model’s flexibility and interpretability. This work proposes a novel approach based on a shifted logarithmic link function that introduces a single tuning parameter to enable a continuous transition between additive and multiplicative combinations. We are the first to incorporate this link function into the Poisson NMF framework and develop an efficient maximum likelihood estimation algorithm whose computational complexity scales linearly with the number of non-zero entries in the data, making it suitable for large-scale sparse datasets. Experimental results demonstrate that the choice of link function substantially influences decomposition outcomes, and the proposed method enhances both representational capacity and interpretability across multiple real-world datasets.

Technology Category

Application Category

πŸ“ Abstract
Poisson non-negative matrix factorization (NMF) is a widely used method to find interpretable"parts-based"decompositions of count data. While many variants of Poisson NMF exist, existing methods assume that the"parts"in the decomposition combine additively. This assumption may be natural in some settings, but not in others. Here we introduce Poisson NMF with the shifted-log link function to relax this assumption. The shifted-log link function has a single tuning parameter, and as this parameter varies the model changes from assuming that parts combine additively (i.e., standard Poisson NMF) to assuming that parts combine more multiplicatively. We provide an algorithm to fit this model by maximum likelihood, and also an approximation that substantially reduces computation time for large, sparse datasets (computations scale with the number of non-zero entries in the data matrix). We illustrate these new methods on a variety of real datasets. Our examples show how the choice of link function in Poisson NMF can substantively impact the results, and how in some settings the use of a shifted-log link function may improve interpretability compared with the standard, additive link.
Problem

Research questions and friction points this paper is trying to address.

Poisson non-negative matrix factorization
additive combination
link function
interpretability
count data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Poisson non-negative matrix factorization
shifted log link
additive-to-multiplicative modeling
scalable inference
count data decomposition
πŸ”Ž Similar Papers
No similar papers found.
E
Eric Weine
Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology
Peter Carbonetto
Peter Carbonetto
University of Chicago
Quantitative genetics
R
Rafael A. Irizarry
Department of Data Science, Dana-Farber Cancer Institute
Matthew Stephens
Matthew Stephens
University of Chicago
StatisticsGenetics