🤖 AI Summary
High cost and scarcity of spatial transcriptomics (ST) data hinder broad adoption. To address this, we propose a novel cross-modal framework integrating masked image modeling and contrastive learning, enabling high-accuracy prediction of spatially resolved gene expression profiles directly from low-cost whole-slide images (WSIs)—the first such approach. Our method jointly leverages multi-scale histological feature extraction, spatially aware representation learning, and cross-modal contrastive alignment to preserve both per-gene prediction fidelity and inter-gene correlation structure. Evaluated across six disease cohorts, it achieves average improvements of 6.27%, 6.11%, and 11.26% in Pearson correlation coefficient (PCC) for highly expressed, highly variable, and biomarker genes, respectively. The model demonstrates strong few-shot generalization and successfully enables cancer tissue spatial localization. This work establishes a new paradigm for cost-effective, high-throughput spatial functional profiling.
📝 Abstract
Spatial transcriptomics is a technology that captures gene expression levels at different spatial locations, widely used in tumor microenvironment analysis and molecular profiling of histopathology, providing valuable insights into resolving gene expression and clinical diagnosis of cancer. Due to the high cost of data acquisition, large-scale spatial transcriptomics data remain challenging to obtain. In this study, we develop a contrastive learning-based deep learning method to predict spatially resolved gene expression from whole-slide images. Evaluation across six different disease datasets demonstrates that, compared to existing studies, our method improves Pearson Correlation Coefficient (PCC) in the prediction of highly expressed genes, highly variable genes, and marker genes by 6.27%, 6.11%, and 11.26% respectively. Further analysis indicates that our method preserves gene-gene correlations and applies to datasets with limited samples. Additionally, our method exhibits potential in cancer tissue localization based on biomarker expression.