BaySC: Uncovering Tissue Architecture in Spatial Multi-Omics via Probabilistic Spatial Clustering

📅 2026-05-14
📈 Citations: 0
Influential: 0
📄 PDF

career value

215K/year
🤖 AI Summary
Current spatial omics clustering methods often suffer from over-smoothing that blurs biological boundaries, reliance on pre-specified cluster numbers, and a lack of effective multi-omics integration mechanisms. To address these limitations, this work proposes BaySC—a Bayesian inference–based spatial clustering framework that uniquely integrates a mixture of finite mixtures (MFM) model with a Markov random field (MRF) to automatically infer the number of spatial domains while preserving local spatial consistency. Furthermore, BaySC introduces an interpretable weighted log-likelihood fusion strategy to quantify the contribution of each omics modality to the resulting tissue atlas. Extensive experiments on ten single-modality and two multi-modality datasets demonstrate that BaySC significantly outperforms existing methods in both clustering accuracy and preservation of spatial topology, as measured by the spARI metric.
📝 Abstract
Spatial domain identification requires jointly modeling molecular signatures and physical coordinates, yet current tools frequently over-smooth biological boundaries, require user-specified cluster numbers, and lack principled multimodal integration. We introduce BaySC, an integrative Bayesian spatial clustering framework for spatial domain identification. BaySC inherently learns the true number of spatial domains from the data by employing a Mixture of Finite Mixtures (MFM) prior. Tissue topology is modeled via a Markov Random Field (MRF) applied to discrete cellular assignments, a strategy that enforces local spatial coherence without distorting the underlying gene expression features. This enables BaySC to accurately map contiguous tissue layers as well as geographically scattered, transcriptionally identical cell populations. Furthermore, BaySC handles spatial multi-omics data through a weighted log-likelihood fusion mechanism executed via Gibbs sampling. This approach assigns interpretable weights to each modality, allowing users to quantify the biological relevance of different data layers to the final tissue map. Validated across ten single-modal spatial transcriptomics and two spatial multi-omics datasets, BaySC yields highly interpretable probabilistic outputs. It demonstrates competitive accuracy on standard clustering metrics and consistently outperforms existing tools in preserving spatial topography, as measured by spatially-aware Adjusted Rand Index (spARI).
Problem

Research questions and friction points this paper is trying to address.

spatial domain identification
spatial multi-omics
biological boundaries
multimodal integration
spatial clustering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian spatial clustering
Mixture of Finite Mixtures
Markov Random Field
spatial multi-omics integration
adaptive cluster number
🔎 Similar Papers
No similar papers found.
X
Xin Li
School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China
X
Xiaofei Dong
School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China
Z
Zhenke Duan
School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China
L
Lulu Shang
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Texas 77030, USA
Xiao Wang
Xiao Wang
Professor of Statistics, Purdue University
Data ScienceAINonparametric StatisticsFunctional Data Analysis
Xinyuan Song
Xinyuan Song
Department of Statistics, The Chinese University of Hong Kong
Latent variable modelsStructural equation modelsBayesian methodsSurvival analysisStatistical computing
H
Hanwen Ning
School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China; Innovation and Talent Base for Digital Technology and Finance, Zhongnan University of Economics and Law
Guanyu Hu
Guanyu Hu
Michigan State University
Bayesian StatisticsBig DataSpatial StatisticsSports StatisticsSurvival Analysis