Divide-and-Conquer Decoupled Network for Cross-Domain Few-Shot Segmentation

📅 2025-11-11

📈 Citations: 0

✨ Influential: 0

career value

230K/year

🤖 AI Summary

Cross-domain few-shot segmentation (CD-FSS) suffers from poor generalization and challenging adaptation to novel domains due to entangled feature representations. To address this, we propose a novel feature disentanglement paradigm: first, adversarial contrastive learning separates class-specific features from domain-invariant ones; second, a matrix-guided dynamic fusion mechanism adaptively integrates class and domain information while preserving spatial structural consistency. Our method synergistically combines contrastive learning, adversarial learning, cross-domain adaptive modulation, and spatially guided multi-branch feature integration. Evaluated on four mainstream benchmarks—PASCAL-5i, COCO-20i, FC4, and ISIC—we achieve significant improvements over existing state-of-the-art methods. Notably, our approach sets new international benchmarks in both cross-domain generalization and few-shot adaptation capability.

Technology Category

Application Category

📝 Abstract

Cross-domain few-shot segmentation (CD-FSS) aims to tackle the dual challenge of recognizing novel classes and adapting to unseen domains with limited annotations. However, encoder features often entangle domain-relevant and category-relevant information, limiting both generalization and rapid adaptation to new domains. To address this issue, we propose a Divide-and-Conquer Decoupled Network (DCDNet). In the training stage, to tackle feature entanglement that impedes cross-domain generalization and rapid adaptation, we propose the Adversarial-Contrastive Feature Decomposition (ACFD) module. It decouples backbone features into category-relevant private and domain-relevant shared representations via contrastive learning and adversarial learning. Then, to mitigate the potential degradation caused by the disentanglement, the Matrix-Guided Dynamic Fusion (MGDF) module adaptively integrates base, shared, and private features under spatial guidance, maintaining structural coherence. In addition, in the fine-tuning stage, to enhanced model generalization, the Cross-Adaptive Modulation (CAM) module is placed before the MGDF, where shared features guide private features via modulation ensuring effective integration of domain-relevant information. Extensive experiments on four challenging datasets show that DCDNet outperforms existing CD-FSS methods, setting a new state-of-the-art for cross-domain generalization and few-shot adaptation.

Problem

Research questions and friction points this paper is trying to address.

Decouple entangled domain and category features

Enable cross-domain generalization with limited annotations

Maintain structural coherence during feature disentanglement

Innovation

Methods, ideas, or system contributions that make the work stand out.

Decomposes features via adversarial-contrastive learning

Fuses features dynamically with matrix guidance

Modulates private features using shared representations

🔎 Similar Papers

TAVP: Task-Adaptive Visual Prompt for Cross-domain Few-shot Segmentation