From Few to More: Scribble-based Medical Image Segmentation via Masked Context Modeling and Continuous Pseudo Labels

📅 2024-08-23
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address performance degradation in weakly supervised medical image segmentation caused by sparse scribble annotations, this paper proposes MaCo, a “from-few-to-many” progressive learning framework. Methodologically, we introduce Mask Context Modeling (MCM), a novel attention mechanism that jointly leverages distance maps and an exponential decay function to generate continuous pseudo-labels (CPLs) with pixel-wise semantic confidence—replacing error-prone hard pseudo-labels. Additionally, we incorporate a self-supervised context consistency constraint, eliminating reliance on auxiliary tasks. This enables robust, continuous modeling of pixel-level semantic confidence. Evaluated on three public medical imaging benchmarks, MaCo consistently outperforms existing scribble-supervised methods, establishing new state-of-the-art results and significantly enhancing model adaptability to annotation sparsity.

Technology Category

Application Category

📝 Abstract
Scribble-based weakly supervised segmentation techniques offer comparable performance to fully supervised methods while significantly reducing annotation costs, making them an appealing alternative. Existing methods often rely on auxiliary tasks to enforce semantic consistency and use hard pseudo labels for supervision. However, these methods often overlook the unique requirements of models trained with sparse annotations. Since the model must predict pixel-wise segmentation maps with limited annotations, the ability to handle varying levels of annotation richness is critical. In this paper, we adopt the principle of `from few to more' and propose MaCo, a weakly supervised framework designed for medical image segmentation. MaCo employs masked context modeling (MCM) and continuous pseudo labels (CPL). MCM uses an attention-based masking strategy to disrupt the input image, compelling the model's predictions to remain consistent with those of the original image. CPL converts scribble annotations into continuous pixel-wise labels by applying an exponential decay function to distance maps, resulting in continuous maps that represent the confidence of each pixel belonging to a specific category, rather than using hard pseudo labels. We evaluate MaCo against other weakly supervised methods using three public datasets. The results indicate that MaCo outperforms competing methods across all datasets, setting a new record in weakly supervised medical image segmentation.
Problem

Research questions and friction points this paper is trying to address.

Improves scribble-based medical image segmentation accuracy
Addresses sparse annotation challenges in segmentation models
Enhances semantic consistency without hard pseudo labels
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Masked Context Modeling for semantic consistency
Applies Continuous Pseudo Labels from scribble annotations
Leverages attention-based masking for robust predictions
🔎 Similar Papers
No similar papers found.
Z
Zhisong Wang
National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science and Engineering, Northwestern Polytechnical University, China
Y
Yiwen Ye
National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science and Engineering, Northwestern Polytechnical University, China
Ziyang Chen
Ziyang Chen
Peking University
Quantum key distributionQuantum random number generation
M
Minglei Shu
Shandong Artificial Intelligence Institute, Qilu University of Technology (Shandong Academy of Sciences), China
Y
Yong Xia
National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, School of Computer Science and Engineering, Northwestern Polytechnical University, China; Ningbo Institute of Northwestern Polytechnical University, Ningbo 315048, China; Research & Development Institute of Northwestern Polytechnical University in Shenzhen, Shenzhen 518057, China