Dual form Complementary Masking for Domain-Adaptive Image Segmentation

πŸ“… 2025-07-16
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing unsupervised domain adaptation (UDA) methods treat masked image modeling (MIM) merely as input perturbation, lacking theoretical grounding and thus limiting its potential for feature extraction and representation learning. To address this, we propose MaskTwinsβ€”a novel framework that, for the first time, reformulates MIM from the perspective of sparse signal recovery. We introduce complementary mask dualities and theoretically prove that they enhance domain-invariant feature learning and explicitly model cross-domain structural consistency. Our method employs a dual-branch network that jointly optimizes complementary mask reconstruction and feature alignment, enabling end-to-end UDA for semantic segmentation without requiring pretraining. Extensive experiments on both natural and biomedical image segmentation benchmarks demonstrate significant improvements over state-of-the-art UDA baselines, validating the generalizability and effectiveness of MaskTwins.

Technology Category

Application Category

πŸ“ Abstract
Recent works have correlated Masked Image Modeling (MIM) with consistency regularization in Unsupervised Domain Adaptation (UDA). However, they merely treat masking as a special form of deformation on the input images and neglect the theoretical analysis, which leads to a superficial understanding of masked reconstruction and insufficient exploitation of its potential in enhancing feature extraction and representation learning. In this paper, we reframe masked reconstruction as a sparse signal reconstruction problem and theoretically prove that the dual form of complementary masks possesses superior capabilities in extracting domain-agnostic image features. Based on this compelling insight, we propose MaskTwins, a simple yet effective UDA framework that integrates masked reconstruction directly into the main training pipeline. MaskTwins uncovers intrinsic structural patterns that persist across disparate domains by enforcing consistency between predictions of images masked in complementary ways, enabling domain generalization in an end-to-end manner. Extensive experiments verify the superiority of MaskTwins over baseline methods in natural and biological image segmentation. These results demonstrate the significant advantages of MaskTwins in extracting domain-invariant features without the need for separate pre-training, offering a new paradigm for domain-adaptive segmentation.
Problem

Research questions and friction points this paper is trying to address.

Enhancing feature extraction in domain-adaptive image segmentation
Theoretical analysis of masked reconstruction in UDA
Integrating masked reconstruction for domain-invariant features
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual complementary masks enhance feature extraction
MaskTwins integrates masked reconstruction directly
Enforces consistency between complementary masked predictions
πŸ”Ž Similar Papers
No similar papers found.
J
Jiawen Wang
University of Science and Technology of China; Institute of Artificial Intelligence, Hefei Comprehensive National Science Center
Yinda Chen
Yinda Chen
University of Science and Technology of China, Xiamen University
Machine Learning TheorySelf-supervised LearningImage Compression
X
Xiaoyu Liu
University of Science and Technology of China
Che Liu
Che Liu
Imperial College London
Multimodal learningAI4Medicine
D
Dong Liu
University of Science and Technology of China
J
Jianqing Gao
iFLYTEK CO., LTD.
Zhiwei Xiong
Zhiwei Xiong
University of Science and Technology of China
computational photographybiomedical image analysis