RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency

📅 2024-11-22
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses alignment failure in cross-modal registration of spatial transcriptomics (ST) data and histopathological images, caused by spatial distortions, modality heterogeneity, and the high-dimensional sparsity of gene expression data. We propose a ranking-consistency-based gene–image representation learning framework. Our key contributions are: (1) a multi-scale ranking alignment loss that enables robust and interpretable cross-modal geometric matching; and (2) a self-supervised teacher–student distillation architecture, where the teacher network models low-noise representations to suppress noise inherent in gene expression measurements. Evaluated on seven public ST datasets, our method significantly improves performance across downstream tasks—including gene expression prediction, tissue section classification, and survival analysis—while achieving superior alignment accuracy and outperforming state-of-the-art methods on all evaluated metrics.

Technology Category

Application Category

📝 Abstract
Spatial transcriptomics (ST) provides essential spatial context by mapping gene expression within tissue, enabling detailed study of cellular heterogeneity and tissue organization. However, aligning ST data with histology images poses challenges due to inherent spatial distortions and modality-specific variations. Existing methods largely rely on direct alignment, which often fails to capture complex cross-modal relationships. To address these limitations, we propose a novel framework that aligns gene and image features using a ranking-based alignment loss, preserving relative similarity across modalities and enabling robust multi-scale alignment. To further enhance the alignment's stability, we employ self-supervised knowledge distillation with a teacher-student network architecture, effectively mitigating disruptions from high dimensionality, sparsity, and noise in gene expression data. Extensive experiments on seven public datasets that encompass gene expression prediction, slide-level classification, and survival analysis demonstrate the efficacy of our method, showing improved alignment and predictive performance over existing methods.
Problem

Research questions and friction points this paper is trying to address.

Aligning spatial transcriptomics with histology images despite distortions
Capturing complex gene-image cross-modal relationships effectively
Mitigating high dimensionality and noise in gene expression data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Rank-based alignment loss for cross-modal consistency
Self-supervised distillation with teacher-student architecture
Multi-scale gene-image feature alignment enhancement
🔎 Similar Papers
No similar papers found.
W
Wentao Huang
Stony Brook University, NY , USA
Meilong Xu
Meilong Xu
Stony Brook University
Machine LearningComputer VisionTopological Data Analysis
X
Xiaoling Hu
Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital and Harvard Medical School, MA, USA
Shahira Abousamra
Shahira Abousamra
Stony Brook University
A
Aniruddha Ganguly
Stony Brook University, NY , USA
S
S. Kapse
Stony Brook University, NY , USA
Alisa Yurovsky
Alisa Yurovsky
Stony Brook Universty
Bioinformatics
Prateek Prasanna
Prateek Prasanna
Associate Professor, Stony Brook University
Medical VisionBiomedical image analysisRadiogenomicsRadiomicsComputational Pathology
T
T. Kurç
Stony Brook University, NY , USA
J
J. Saltz
Stony Brook University, NY , USA
M
Michael L. Miller
Department of Pathology and Cell Biology, Columbia University, NY , USA
C
Chao Chen
Stony Brook University, NY , USA