Hard-aware Instance Adaptive Self-training for Unsupervised Cross-domain Semantic Segmentation

📅 2023-02-14
🏛️ IEEE Transactions on Pattern Analysis and Machine Intelligence
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address domain shift in unsupervised domain adaptation for semantic segmentation—where labeled source and unlabeled target domains exhibit distributional divergence—this paper proposes a hard-sample-aware instance-adaptive pseudo-labeling framework. Our method comprises three key components: (1) an instance-adaptive pseudo-label selector that dynamically identifies high-quality pseudo-labels based on prediction confidence and local consistency; (2) hard-class-aware cross-image augmentation, which selectively mixes features in the embedding space to enhance discrimination of challenging classes; and (3) region-adaptive regularization, the first to extend consistency constraints beyond pseudo-labeled regions to unlabeled areas via dynamic spatial weighting. Evaluated on three standard benchmarks—GTA5→Cityscapes, SYNTHIA→Cityscapes, and Cityscapes→Oxford RobotCar—our approach consistently outperforms state-of-the-art methods. The implementation is publicly available.
📝 Abstract
The divergence between labeled training data and unlabeled testing data is a significant challenge for recent deep learning models. Unsupervised domain adaptation (UDA) attempts to solve such problem. Recent works show that self-training is a powerful approach to UDA. However, existing methods have difficulty in balancing the scalability and performance. In this paper, we propose a hard-aware instance adaptive self-training framework for UDA on the task of semantic segmentation. To effectively improve the quality and diversity of pseudo-labels, we develop a novel pseudo-label generation strategy with an instance adaptive selector. We further enrich the hard class pseudo-labels with inter-image information through a skillfully designed hard-aware pseudo-label augmentation. Besides, we propose the region-adaptive regularization to smooth the pseudo-label region and sharpen the non-pseudo-label region. For the non-pseudo-label region, consistency constraint is also constructed to introduce stronger supervision signals during model optimization. Our method is so concise and efficient that it is easy to be generalized to other UDA methods. Experiments on GTA5 Cityscapes, SYNTHIA Cityscapes, and Cityscapes Oxford RobotCar demonstrate the superior performance of our approach compared with the state-of-the-art methods. Our codes are available at https://github.com/bupt-ai-cz/HIAST.
Problem

Research questions and friction points this paper is trying to address.

Addresses domain gap in unsupervised semantic segmentation
Improves pseudo-label quality with adaptive strategies
Enhances hard class learning via inter-image augmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Instance adaptive selector for pseudo-label generation
Hard-aware pseudo-label augmentation for hard classes
Region-adaptive regularization for smoothing and sharpening
🔎 Similar Papers
No similar papers found.
C
Chuanglu Zhu
School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
K
Kebin Liu
School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Wenqi Tang
Wenqi Tang
School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Ke Mei
Ke Mei
Tencent Wechat
deep learningcomputer vision
J
Jiaqi Zou
School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
Tiejun Huang
Tiejun Huang
Professor,School of Computer Science, Peking University
Visual Information Processing