Multi-level Asymmetric Contrastive Learning for Volumetric Medical Image Segmentation Pre-training

πŸ“… 2023-09-21
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Medical image segmentation suffers from scarce annotated data, while existing contrastive learning methods are largely confined to image-level representations and lack effective encoder-decoder co-training. To address this, we propose MACL, a Multi-level Asymmetric Contrastive Learning frameworkβ€”the first to jointly pre-train encoder and decoder by integrating feature-level, image-level, and pixel-level representations. Key innovations include: (1) a multi-level joint contrastive loss; (2) an asymmetric dual-branch network architecture; (3) voxel-wise positive/negative sample construction and cross-scale feature alignment; and (4) seamless compatibility with U-Net-based backbones. MACL outperforms 11 state-of-the-art contrastive methods across eight medical imaging datasets. With only 10% labeled data, it achieves Dice score improvements of 1.72–7.87% on four benchmarks (e.g., ACDC). Moreover, it consistently delivers SOTA performance when integrated into five distinct U-Net variants, demonstrating strong generalization and architectural flexibility.
πŸ“ Abstract
Medical image segmentation is a fundamental yet challenging task due to the arduous process of acquiring large volumes of high-quality labeled data from experts. Contrastive learning offers a promising but still problematic solution to this dilemma. Firstly existing medical contrastive learning strategies focus on extracting image-level representation, which ignores abundant multi-level representations. Furthermore they underutilize the decoder either by random initialization or separate pre-training from the encoder, thereby neglecting the potential collaboration between the encoder and decoder. To address these issues, we propose a novel multi-level asymmetric contrastive learning framework named MACL for volumetric medical image segmentation pre-training. Specifically, we design an asymmetric contrastive learning structure to pre-train encoder and decoder simultaneously to provide better initialization for segmentation models. Moreover, we develop a multi-level contrastive learning strategy that integrates correspondences across feature-level, image-level, and pixel-level representations to ensure the encoder and decoder capture comprehensive details from representations of varying scales and granularities during the pre-training phase. Finally, experiments on 8 medical image datasets indicate our MACL framework outperforms existing 11 contrastive learning strategies. i.e. Our MACL achieves a superior performance with more precise predictions from visualization figures and 1.72%, 7.87%, 2.49% and 1.48% Dice higher than previous best results on ACDC, MMWHS, HVSMR and CHAOS with 10% labeled data, respectively. And our MACL also has a strong generalization ability among 5 variant U-Net backbones. Our code will be released at https://github.com/stevezs315/MACL.
Problem

Research questions and friction points this paper is trying to address.

Enhances volumetric medical image segmentation
Improves encoder-decoder collaboration in pre-training
Integrates multi-level representations in contrastive learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Asymmetric contrastive learning structure
Multi-level contrastive learning strategy
Simultaneous encoder-decoder pre-training
πŸ”Ž Similar Papers
No similar papers found.
Shuang Zeng
Shuang Zeng
Peking University, Georgia Institute of Technology
Self-supervised Contrastive LearningMedical Image SegmentationSuperpixelLarge Language Model
L
Lei Zhu
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing 100191, China; Department of Biomedical Engineering, Peking University, Beijing 100871, China; National Biomedical Imaging Center, Peking University, Beijing 100871, China; Institute of Biomedical Engineering, Shenzhen Bay Laboratory, Shenzhen 5181071, China
X
Xinliang Zhang
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing 100191, China; Department of Biomedical Engineering, Peking University, Beijing 100871, China; National Biomedical Imaging Center, Peking University, Beijing 100871, China; Institute of Biomedical Engineering, Shenzhen Bay Laboratory, Shenzhen 5181071, China
Q
Qian Chen
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing 100191, China; Department of Biomedical Engineering, Peking University, Beijing 100871, China; National Biomedical Imaging Center, Peking University, Beijing 100871, China; Institute of Biomedical Engineering, Shenzhen Bay Laboratory, Shenzhen 5181071, China
Hangzhou He
Hangzhou He
PhD student, Peking University
ExplainabilityMedical Image AnalysisTrustworthy AI
Lujia Jin
Lujia Jin
Peking University
Image DenoisingImage Super ResolutionMedical Image ProcessingDeep Learning
Z
Zifeng Tian
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing 100191, China; Department of Biomedical Engineering, Peking University, Beijing 100871, China; National Biomedical Imaging Center, Peking University, Beijing 100871, China; Institute of Biomedical Engineering, Shenzhen Bay Laboratory, Shenzhen 5181071, China
Qiushi Ren
Qiushi Ren
Peking University
Z
Zhaoheng Xie
Institute of Medical Technology, Peking University Health Science Center, Peking University, Beijing 100191, China; Department of Biomedical Engineering, Peking University, Beijing 100871, China; National Biomedical Imaging Center, Peking University, Beijing 100871, China; Institute of Biomedical Engineering, Shenzhen Bay Laboratory, Shenzhen 5181071, China
Yanye Lu
Yanye Lu
Peking University
Medical Imaging/Deep Learning/Machine Learning