A Dependent Feature Allocation Model Based on Random Fields

📅 2025-12-19

📈 Citations: 0

✨ Influential: 0

career value

203K/year

🤖 AI Summary

Traditional nonparametric feature allocation models struggle to explicitly capture dependency structures, covariate effects, and temporal-spatial dynamics. To address this, we propose a Bayesian nonparametric framework grounded in Gaussian Markov random fields (GMRFs), the first to embed GMRFs into the logit layer of the feature paintbox, thereby jointly modeling local dependencies among latent variables. Our approach innovatively integrates a data-adaptive k-nearest-neighbor graph structure with an Ornstein–Uhlenbeck process, enabling principled modeling of spatiotemporal evolution and low-rank feature correlations. The resulting method achieves both high-dimensional scalability and interpretability. Evaluated on multi-drug combination clinical data, it successfully infers patients’ latent health states, significantly improving accuracy in dependency-aware feature allocation and fidelity in covariate-response modeling.

Technology Category

Application Category

📝 Abstract

We introduce a flexible framework for modeling dependent feature allocations. Our approach addresses limitations in traditional nonparametric methods by directly modeling the logit-probability surface of the feature paintbox, enabling the explicit incorporation of covariates and complex but tractable dependence structures. The core of our model is a Gaussian Markov Random Field (GMRF), which we use to robustly decompose the latent field, separating a structural component based on the baseline covariates from intrinsic, unstructured heterogeneity. This structure is not a rigid grid but a sparse k-nearest neighbors graph derived from the latent geometry in the data, ensuring high-dimensional tractability. We extend this framework to a dynamic spatio-temporal process, allowing item effects to evolve via an Ornstein-Uhlenbeck process. Feature correlations are captured using a low-rank factorization of their joint prior. We demonstrate our model's utility by applying it to a polypharmacy dataset, successfully inferring latent health conditions from patient drug profiles.

Problem

Research questions and friction points this paper is trying to address.

Models dependent feature allocations with covariates and complex dependencies

Uses Gaussian Markov Random Field to separate structural and intrinsic heterogeneity

Extends to dynamic spatio-temporal processes for evolving item effects

Innovation

Methods, ideas, or system contributions that make the work stand out.

Gaussian Markov Random Field for feature allocation

Sparse k-nearest neighbors graph for tractability

Dynamic spatio-temporal Ornstein-Uhlenbeck process evolution

🔎 Similar Papers

Spurious Correlations in Machine Learning: A Survey