From Snapshots to Trajectories: Learning Single-Cell Gene Expression Dynamics via Conditional Flow Matching

📅 2026-05-21
📈 Citations: 0
Influential: 0
📄 PDF

career value

197K/year
🤖 AI Summary
Single-cell RNA sequencing data are typically collected as unpaired, discrete temporal snapshots, posing significant challenges for accurately modeling the continuous dynamic evolution of cellular states. To address this, this work proposes a single-cell Flow Matching (scFM) framework that innovatively integrates entropy-regularized optimal transport with conditional flow matching. By constructing a soft-supervised objective, scFM learns time-dependent velocity fields and incorporates both bidirectional velocity consistency constraints and distribution-level alignment mechanisms. This approach effectively mitigates error accumulation and distribution drift in long-horizon predictions, substantially improving the accuracy of interpolation and extrapolation on real temporal datasets. Consequently, scFM enables more precise and temporally coherent reconstruction and visualization of cellular trajectories.
📝 Abstract
Single-cell RNA sequencing (scRNA-seq) provides high-dimensional profiles of cellular states, enabling data-driven modeling of cellular dynamics over time. In practice, time-resolved scRNA-seq is collected at only a few discrete time points as unpaired snapshot populations, leaving substantial temporal gaps. This motivates trajectory inference at unmeasured time points. Existing methods mainly follow two directions, optimal-transport (OT) alignment provides distribution-level matching between observed snapshots, while continuous-time generative models support forecasting via learned dynamics. However, two challenges remain: (i) unpaired snapshots render local transitions between adjacent time points ambiguous, leading to unstable supervision; and (ii) long-horizon prediction relies on repeated integration, where small modeling errors compound and cause distribution drift. To address these challenges, we propose single-cell Flow Matching (scFM), a latent generative framework based on coupling-conditioned flow matching. First, we compute entropically regularized OT couplings between adjacent snapshots and use them to construct soft, weighted flow-matching targets for learning time-dependent velocity fields. Second, we learn bidirectional velocity fields and leverage their consistency to refine couplings and improve temporal coherence under sparse supervision. Third, we introduce distribution-level alignment and latent dynamic regularization to anchor long rollouts and mitigate drift. Experiments on real-world time-series scRNA-seq datasets show that scFM consistently improves distributional prediction performance for both temporal interpolation and extrapolation. Moreover, scFM yields more accurate trajectory reconstruction and temporally coherent visualizations where intermediate time points are absent, indicating a more faithful recovery of underlying temporal gene expression dynamics.
Problem

Research questions and friction points this paper is trying to address.

single-cell RNA sequencing
trajectory inference
unpaired snapshots
temporal dynamics
distribution drift
Innovation

Methods, ideas, or system contributions that make the work stand out.

conditional flow matching
optimal transport
single-cell RNA-seq
trajectory inference
latent generative model
🔎 Similar Papers
No similar papers found.
S
Siyu Pu
Computer Network Information Center, Chinese Academy of Sciences
Q
Qingqing Long
Computer Network Information Center, Chinese Academy of Sciences
X
Xiaohan Huang
Computer Network Information Center, Chinese Academy of Sciences
Haotian Chen
Haotian Chen
University of California, Los Angeles
Political EconomyNon-market StrategyAmerican Politics
J
Jiajia Wang
Computer Network Information Center, Chinese Academy of Sciences
M
Meng Xiao
Computer Network Information Center, Chinese Academy of Sciences
X
Xiao Luo
University of California, Los Angeles
H
Hengshu Zhu
Computer Network Information Center, Chinese Academy of Sciences
Yuanchun Zhou
Yuanchun Zhou
Computer Network Information Center,CAS
Data MiningBig Data Analysis
Xuezhi Wang
Xuezhi Wang
Research Scientist, Google DeepMind
Machine LearningNatural Language Processing