DAG Learning from Zero-Inflated Count Data Using Continuous Optimization

📅 2025-12-18
📈 Citations: 0
Influential: 0
📄 PDF

career value

186K/year
🤖 AI Summary
This work addresses directed acyclic graph (DAG) structure learning from zero-inflated count data. We propose ZICO, the first framework unifying zero-inflated generalized linear models (ZIGLMs) with continuous differentiable DAG optimization. ZICO jointly optimizes a smooth score-based objective via a differentiable acyclicity constraint (e.g., TRACE), L1 sparsity regularization, and fully vectorized mini-batch training. Crucially, we design a novel differentiable zero-inflated likelihood function, substantially improving computational efficiency and numerical stability for high-dimensional settings. Experiments demonstrate that ZICO achieves higher structural accuracy and faster convergence on synthetic benchmarks; in gene regulatory network inference, it matches or surpasses state-of-the-art methods; and it scales to thousands of variables, confirming practical applicability.

Technology Category

Application Category

📝 Abstract
We address network structure learning from zero-inflated count data by casting each node as a zero-inflated generalized linear model and optimizing a smooth, score-based objective under a directed acyclic graph constraint. Our Zero-Inflated Continuous Optimization (ZICO) approach uses node-wise likelihoods with canonical links and enforces acyclicity through a differentiable surrogate constraint combined with sparsity regularization. ZICO achieves superior performance with faster runtimes on simulated data. It also performs comparably to or better than common algorithms for reverse engineering gene regulatory networks. ZICO is fully vectorized and mini-batched, enabling learning on larger variable sets with practical runtimes in a wide range of domains.
Problem

Research questions and friction points this paper is trying to address.

Learning network structure from zero-inflated count data
Optimizing a smooth objective under DAG constraints
Enhancing performance for gene regulatory network inference
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses zero-inflated generalized linear models for nodes
Applies differentiable acyclicity constraint with sparsity regularization
Implements fully vectorized mini-batched optimization for scalability