๐ค AI Summary
Spatial transcriptomics (ST) data pose modeling challenges due to discreteness, ultra-high dimensionality, sparsity, and highly skewed gene expression distributions. To address these, we propose the first graph-augmented autoencoder-guided implicit neural representation (INR) framework, which achieves continuous, compact modeling via structure-aware embedding learning. We further introduce a novel regression-to-classification loss to mitigate fitting bias induced by the heavy-tailed expression distribution. Our method consistently outperforms existing INR variants and state-of-the-art methods across multiple ST platforms. It achieves significant improvements in numerical fidelity (PSNR/SSIM), geneโspatial statistical correlation (Moranโs I, Spearman correlation coefficient), and biological pathway consistency. Crucially, it enhances signal fidelity for key genes, thereby enabling more reliable downstream spatial decomposition and functional inference.
๐ Abstract
Spatial Transcriptomics (ST) is a method that captures gene expression profiles aligned with spatial coordinates. The discrete spatial distribution and the super-high dimensional sequencing results make ST data challenging to be modeled effectively. In this paper, we manage to model ST in a continuous and compact manner by the proposed tool, SUICA, empowered by the great approximation capability of Implicit Neural Representations (INRs) that can enhance both the spatial density and the gene expression. Concretely within the proposed SUICA, we incorporate a graph-augmented Autoencoder to effectively model the context information of the unstructured spots and provide informative embeddings that are structure-aware for spatial mapping. We also tackle the extremely skewed distribution in a regression-by-classification fashion and enforce classification-based loss functions for the optimization of SUICA. By extensive experiments of a wide range of common ST platforms under varying degradations, SUICA outperforms both conventional INR variants and SOTA methods regarding numerical fidelity, statistical correlation, and bio-conservation. The prediction by SUICA also showcases amplified gene signatures that enriches the bio-conservation of the raw data and benefits subsequent analysis. The code is available at https://github.com/Szym29/SUICA.