🤖 AI Summary
Single-cell RNA sequencing data are typically collected as unpaired, discrete temporal snapshots, posing significant challenges for accurately modeling the continuous dynamic evolution of cellular states. To address this, this work proposes a single-cell Flow Matching (scFM) framework that innovatively integrates entropy-regularized optimal transport with conditional flow matching. By constructing a soft-supervised objective, scFM learns time-dependent velocity fields and incorporates both bidirectional velocity consistency constraints and distribution-level alignment mechanisms. This approach effectively mitigates error accumulation and distribution drift in long-horizon predictions, substantially improving the accuracy of interpolation and extrapolation on real temporal datasets. Consequently, scFM enables more precise and temporally coherent reconstruction and visualization of cellular trajectories.
📝 Abstract
Single-cell RNA sequencing (scRNA-seq) provides high-dimensional profiles of cellular states, enabling data-driven modeling of cellular dynamics over time. In practice, time-resolved scRNA-seq is collected at only a few discrete time points as unpaired snapshot populations, leaving substantial temporal gaps. This motivates trajectory inference at unmeasured time points. Existing methods mainly follow two directions, optimal-transport (OT) alignment provides distribution-level matching between observed snapshots, while continuous-time generative models support forecasting via learned dynamics. However, two challenges remain: (i) unpaired snapshots render local transitions between adjacent time points ambiguous, leading to unstable supervision; and (ii) long-horizon prediction relies on repeated integration, where small modeling errors compound and cause distribution drift. To address these challenges, we propose single-cell Flow Matching (scFM), a latent generative framework based on coupling-conditioned flow matching. First, we compute entropically regularized OT couplings between adjacent snapshots and use them to construct soft, weighted flow-matching targets for learning time-dependent velocity fields. Second, we learn bidirectional velocity fields and leverage their consistency to refine couplings and improve temporal coherence under sparse supervision. Third, we introduce distribution-level alignment and latent dynamic regularization to anchor long rollouts and mitigate drift. Experiments on real-world time-series scRNA-seq datasets show that scFM consistently improves distributional prediction performance for both temporal interpolation and extrapolation. Moreover, scFM yields more accurate trajectory reconstruction and temporally coherent visualizations where intermediate time points are absent, indicating a more faithful recovery of underlying temporal gene expression dynamics.