CellxPert: Inference-Time MCMC Steering of a Multi-Omics Single-Cell Foundation Model for In-Silico Perturbation

📅 2026-04-30
📈 Citations: 0
Influential: 0
📄 PDF

career value

219K/year
🤖 AI Summary
Existing single-cell foundation models struggle to accurately simulate gene perturbation effects and lack a unified framework for modeling multi-omics and spatial data. This work proposes a scalable, unified foundation model that integrates a multimodal Transformer architecture with LoRA fine-tuning and, for the first time, combines Metropolis-Hastings MCMC sampling with masked conditional distributions to enable continuous, interpretable simulation of transcriptional responses under perturbations—avoiding out-of-distribution artifacts caused by conventional token-based manipulations. The model supports 154 fine-grained cell-type annotations and incorporates spatial multi-omics data such as MERFISH and mass spectrometry imaging. It significantly outperforms current methods across multiple benchmarks—including PBMC68K and Replogle Perturb-seq—in tasks spanning cell annotation, perturbation response prediction, and multi-omics integration.
📝 Abstract
In this work, we introduce CellxPert, a scalable multimodal foundation model that unifies single-cell and spatial multi-omics within a common representation space. CellxPert jointly encodes transcriptomic (scRNA-seq), chromatin-accessibility (ATAC-seq), and surface-proteomic (CITE-seq) measurements, while directly incorporating MERFISH and imaging mass-cytometry data as 2D or 3D spatial-visual layers. CellxPert facilitates four key downstream tasks out of the box: (i) cell-type annotation across a broad ontology of 154 largely overlapping identities -- the largest label space addressed to date and a stringent test of fine-grained discrimination, (ii) efficient fine-tuning using Low Rank Adaptation (LoRA), (iii) genome-wide transcriptomic response prediction to in-silico perturbations (ISP), and (iv) seamless multi-omic integration across various assays and platforms. Unlike current single-cell foundation models, which approximate gene perturbations by deleting or reordering tokenized gene expression ranks, CellxPert employs a Metropolis-Hastings sampler whose proposal kernel uses the model's masked conditional distributions to transition to new transcriptomic states conditioned on the perturbed genes. This Markov-chain procedure mitigates out-of-distribution artifacts introduced by abrupt token manipulation and produces trajectories that are biologically interpretable. Evaluations on PBMC68K, Replogle Perturb-seq, Systema, and BMMC benchmarks show that CellxPert surpasses classical and state-of-the-art baselines in cell-type annotation, perturbation response prediction, and multi-omic integration.
Problem

Research questions and friction points this paper is trying to address.

in-silico perturbation
single-cell multi-omics
transcriptomic response prediction
foundation model
out-of-distribution artifacts
Innovation

Methods, ideas, or system contributions that make the work stand out.

in-silico perturbation
MCMC steering
multi-omics foundation model
Metropolis-Hastings sampling
spatial single-cell integration
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid