ScDiVa: Masked Discrete Diffusion for Joint Modeling of Single-Cell Identity and Expression

📅 2026-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Single-cell RNA sequencing data are inherently high-dimensional, sparse, and unordered, posing challenges for conventional autoregressive generative models that often introduce sequential bias and error accumulation. To address this, this work proposes scDiVa, a masked discrete diffusion foundation model that precisely aligns with the dropout-induced missingness in single-cell data through a continuous-time forward masking mechanism, jointly modeling discrete gene identities and continuous expression values. scDiVa innovatively integrates a bidirectional denoising architecture, entropy-normalized serialization, and latent anchor tokens to enhance expression reconstruction accuracy while preserving global cellular identity consistency. Pretrained on 59 million cells, scDiVa demonstrates exceptional transfer performance across diverse downstream tasks, including batch integration, cell type annotation, and perturbation response prediction.

Technology Category

Application Category

📝 Abstract
Single-cell RNA-seq profiles are high-dimensional, sparse, and unordered, causing autoregressive generation to impose an artificial ordering bias and suffer from error accumulation. To address this, we propose scDiVa, a masked discrete diffusion foundation model that aligns generation with the dropout-like corruption process by defining a continuous-time forward masking mechanism in token space. ScDiVa features a bidirectional denoiser that jointly models discrete gene identities and continuous values, utilizing entropy-normalized serialization and a latent anchor token to maximize information efficiency and preserve global cell identity. The model is trained via depth-invariant time sampling and a dual denoising objective to simulate varying sparsity levels while ensuring precise recovery of both identity and magnitude. Pre-trained on 59 million cells, scDiVa achieves strong transfer performance across major benchmarks, including batch integration, cell type annotation, and perturbation response prediction. These results suggest that masked discrete diffusion serves as a biologically coherent and effective alternative to autoregression.
Problem

Research questions and friction points this paper is trying to address.

single-cell RNA-seq
autoregressive generation
ordering bias
error accumulation
sparsity
Innovation

Methods, ideas, or system contributions that make the work stand out.

masked discrete diffusion
single-cell RNA-seq
bidirectional denoiser
entropy-normalized serialization
latent anchor token
🔎 Similar Papers
No similar papers found.
Mingxuan Wang
Mingxuan Wang
The Chinese University of Hong Kong, Shenzhen
Speech SynthesisSpoken Language Processing
Cheng Chen
Cheng Chen
East China Normal University
Online LearningOptimizationNumerical Linear Algebra
G
Gaoyang Jiang
School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, China
Z
Zijia Ren
School of Mathematics, Jilin University, Changchun, China
C
Chuangxin Zhao
Beijing Academy of Artificial Intelligence, Beijing, China
Lu Shi
Lu Shi
Postdoc, Tsinghua University
RoboticsControlData-DrivenKoopman Operator
Y
Yanbiao Ma
Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China