Exploiting Inherent Class Label: Towards Robust Scribble Supervised Semantic Segmentation

📅 2025-03-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Weakly supervised semantic segmentation with scribbles faces two key challenges: insufficient supervision due to sparse annotations and inconsistent predictions arising from diverse scribble styles. To address these, we propose the Class-Driven Scribble Augmentation Network (CD-SAN), introducing two novel components: a localization correction module to refine object region localization, and a distance-aware module to explicitly model spatial relationships among pixels. We further design a scribble simulation algorithm and construct ScribbleCOCO/ScribbleCityscapes—the first benchmark supporting multi-style scribble evaluation. Our method integrates pseudo-label optimization, noise-robust training, and modular feature localization. Extensive experiments demonstrate significant improvements in segmentation accuracy and prediction stability under multi-style scribble settings, surpassing existing state-of-the-art methods. The code and datasets will be publicly released.

Technology Category

Application Category

📝 Abstract
Scribble-based weakly supervised semantic segmentation leverages only a few annotated pixels as labels to train a segmentation model, presenting significant potential for reducing the human labor involved in the annotation process. This approach faces two primary challenges: first, the sparsity of scribble annotations can lead to inconsistent predictions due to limited supervision; second, the variability in scribble annotations, reflecting differing human annotator preferences, can prevent the model from consistently capturing the discriminative regions of objects, potentially leading to unstable predictions. To address these issues, we propose a holistic framework, the class-driven scribble promotion network, for robust scribble-supervised semantic segmentation. This framework not only utilizes the provided scribble annotations but also leverages their associated class labels to generate reliable pseudo-labels. Within the network, we introduce a localization rectification module to mitigate noisy labels and a distance perception module to identify reliable regions surrounding scribble annotations and pseudo-labels. In addition, we introduce new large-scale benchmarks, ScribbleCOCO and ScribbleCityscapes, accompanied by a scribble simulation algorithm that enables evaluation across varying scribble styles. Our method demonstrates competitive performance in both accuracy and robustness, underscoring its superiority over existing approaches. The datasets and the codes will be made publicly available.
Problem

Research questions and friction points this paper is trying to address.

Addresses inconsistency in scribble-based semantic segmentation predictions.
Mitigates variability in scribble annotations for stable object recognition.
Proposes a framework to generate reliable pseudo-labels from scribble annotations.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Class-driven scribble promotion network framework
Localization rectification module for noise reduction
Distance perception module for reliable region identification
🔎 Similar Papers
No similar papers found.
X
Xinliang Zhang
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing, and also with Department of Biomedical Engineering, Peking University, Beijing, China, and also with National Biomedical Imaging Center, Peking University, Beijing 100871, China, and also with Peking University Shenzhen Graduate School, Shenzhen 518055, China
L
Lei Zhu
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing, and also with Department of Biomedical Engineering, Peking University, Beijing, China, and also with National Biomedical Imaging Center, Peking University, Beijing 100871, China, and also with Peking University Shenzhen Graduate School, Shenzhen 518055, China
Shuang Zeng
Shuang Zeng
Peking University, Georgia Institute of Technology
Self-supervised Contrastive LearningMedical Image SegmentationSuperpixelLarge Language Model
Hangzhou He
Hangzhou He
PhD student, Peking University
ExplainabilityMedical Image AnalysisTrustworthy AI
O
Ourui Fu
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing, and also with Department of Biomedical Engineering, Peking University, Beijing, China, and also with National Biomedical Imaging Center, Peking University, Beijing 100871, China, and also with Peking University Shenzhen Graduate School, Shenzhen 518055, China
Zhengjian Yao
Zhengjian Yao
Peking University
computer visiongenerative model
Z
Zhaoheng Xie
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing, and also with Department of Biomedical Engineering, Peking University, Beijing, China, and also with National Biomedical Imaging Center, Peking University, Beijing 100871, China, and also with Peking University Shenzhen Graduate School, Shenzhen 518055, China
Yanye Lu
Yanye Lu
Peking University
Medical Imaging/Deep Learning/Machine Learning