Learning biophysical models of gene regulation with probability flow matching

📅 2026-04-27
📈 Citations: 0
Influential: 0
📄 PDF

career value

213K/year
🤖 AI Summary
Current methods struggle to infer gene regulatory dynamics from single-cell RNA sequencing data that are both mechanistically interpretable and generalizable across conditions. This work proposes a Probabilistic Flow Matching (PFM) framework that, for the first time, integrates biophysical consistency constraints into stochastic process modeling to directly learn regulatory-mechanism-compliant dynamical systems from time-resolved single-cell data. PFM accurately captures cell fate decisions, lineage transitions, and perturbation responses while jointly inferring proliferation and death dynamics. Experiments on three hematopoietic datasets demonstrate that PFM not only achieves high-precision interpolation but also correctly recapitulates lineage transitions, fate specification, and genetic perturbation effects, significantly outperforming non-mechanistic baseline approaches.
📝 Abstract
Cellular differentiation is governed by gene regulatory networks, the high-dimensional stochastic biochemical systems that determine the transcriptional landscape and mediate cellular responses to signals and perturbations. Although single-cell RNA sequencing provides quantitative snapshots of the transcriptome, current methods for inferring gene-regulatory dynamics often lack mechanistic interpretability and fail to generalize to unseen conditions. Here we introduce Probability Flow Matching (PFM), a scalable framework for learning biophysically consistent stochastic processes directly from time-resolved single-cell measurements. Applying PFM to three hematopoiesis datasets, we show that models with similar interpolation accuracy can encode fundamentally different dynamics, with only biophysically consistent formulations accurately capturing mechanisms of lineage transitions, fate specification, and gene perturbation responses. We further demonstrate that PFM accommodates unbalanced populations, enabling simultaneous inference of cellular proliferation and death dynamics. Together, these results establish PFM as a flexible, scalable framework for integrating mechanistic modeling with single-cell omics.
Problem

Research questions and friction points this paper is trying to address.

gene regulatory networks
single-cell RNA sequencing
mechanistic interpretability
generalization
cellular differentiation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Probability Flow Matching
gene regulatory networks
single-cell RNA sequencing
biophysical modeling
cellular dynamics
💼 Related Jobs
Postdoctoral Fellow – AI/ML Enabled Bioprocess Modeling and Control
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Andover
S
Suryanarayana Maddu
Center for Computational Biology, Flatiron Institute, New York, NY, USA, 10010; Molecular & Cellular Biology Department, Harvard University, Cambridge, MA 02138
Victor Chardès
Victor Chardès
Center for Computational Biology, Flatiron Institute
BiophysicsBiologyStatistical PhysicsStochastic Processes
M
Michael J. Shelley
Center for Computational Biology, Flatiron Institute, New York, NY, USA, 10010; Courant Institute of Mathematical Sciences, New York University, New York, NY, USA, 10012