ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

📅 2025-07-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Subcellular structures exhibit high morphological and spatial variability, causing existing segmentation models—including SAM—to suffer from label bias and insufficient fine-grained detail capture, thereby limiting generalization. To address this, we propose ScSAM: a framework that synergistically integrates SAM’s strong semantic representations with cell-structure priors learned via MAE pretraining. We introduce a feature alignment fusion module to harmonize these complementary cues and design a cosine-similarity-based class prompt encoder that explicitly models inter-class relationships to mitigate training bias induced by data imbalance. Extensive experiments across multiple subcellular image datasets demonstrate that ScSAM consistently outperforms state-of-the-art methods, achieving significant improvements in both segmentation accuracy and cross-dataset generalization.

Technology Category

Application Category

📝 Abstract
The significant morphological and distributional variability among subcellular components poses a long-standing challenge for learning-based organelle segmentation models, significantly increasing the risk of biased feature learning. Existing methods often rely on single mapping relationships, overlooking feature diversity and thereby inducing biased training. Although the Segment Anything Model (SAM) provides rich feature representations, its application to subcellular scenarios is hindered by two key challenges: (1) The variability in subcellular morphology and distribution creates gaps in the label space, leading the model to learn spurious or biased features. (2) SAM focuses on global contextual understanding and often ignores fine-grained spatial details, making it challenging to capture subtle structural alterations and cope with skewed data distributions. To address these challenges, we introduce ScSAM, a method that enhances feature robustness by fusing pre-trained SAM with Masked Autoencoder (MAE)-guided cellular prior knowledge to alleviate training bias from data imbalance. Specifically, we design a feature alignment and fusion module to align pre-trained embeddings to the same feature space and efficiently combine different representations. Moreover, we present a cosine similarity matrix-based class prompt encoder to activate class-specific features to recognize subcellular categories. Extensive experiments on diverse subcellular image datasets demonstrate that ScSAM outperforms state-of-the-art methods.
Problem

Research questions and friction points this paper is trying to address.

Debiasing subcellular segmentation from morphological variability
Overcoming biased training in organelle feature learning
Enhancing fine-grained spatial details in SAM for subcellular images
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fuses SAM with MAE-guided cellular prior knowledge
Aligns embeddings via feature alignment module
Uses cosine similarity for class-specific features
🔎 Similar Papers
No similar papers found.
Bo Fang
Bo Fang
University of Texas at Arlington
fault tolerancedistributed systemquantum computingHPC
J
Jianan Fan
School of Computer Science, University of Sydney, Australia
Dongnan Liu
Dongnan Liu
The University of Sydney
computer visionlarge language modelmedical image analysis
Hang Chang
Hang Chang
Lawrence Berkeley National Laboratory
Artificial Intelligence/Machine LearningComputer VisionBioinformaticsCancer Research
G
Gerald J. Shami
School of Medical Sciences (Molecular and Cellular Biomedicine), University of Sydney, Australia and Australian Centre for Microscopy and Microanalysis, University of Sydney, Australia
F
Filip Braet
School of Medical Sciences (Molecular and Cellular Biomedicine), University of Sydney, Australia and Australian Centre for Microscopy and Microanalysis, University of Sydney, Australia
Weidong Cai
Weidong Cai
Clinical Associate Professor, Stanford University School of Medicine
functional neuroimagingmachine learningcognitivedevelopmentalclinical neuroscience