CMP: A Composable Meta Prompt for SAM-Based Cross-Domain Few-Shot Segmentation

📅 2025-07-22

📈 Citations: 0

✨ Influential: 0

career value

231K/year

🤖 AI Summary

To address the generalization bottleneck in cross-domain few-shot segmentation (CD-FSS) caused by data scarcity and domain shift, this paper proposes a composable meta-prompting framework tailored for the Segment Anything Model (SAM). To mitigate SAM’s reliance on handcrafted prompts and its limited cross-domain adaptability, we introduce three key components: reference augmentation and transformation, composable meta-prompt generation, and frequency-domain-aware interaction—enabling automatic prompt construction, semantic expansion, and domain-difference suppression. Crucially, our framework avoids fine-tuning SAM’s backbone, achieving strong cross-domain transfer solely via lightweight prompt engineering. Evaluated on four standard CD-FSS benchmarks, it achieves 71.8% and 74.5% mIoU under 1-shot and 5-shot settings, respectively—outperforming prior methods significantly. This work establishes an efficient, generalizable, and interpretable paradigm for CD-FSS.

Technology Category

Application Category

📝 Abstract

Cross-Domain Few-Shot Segmentation (CD-FSS) remains challenging due to limited data and domain shifts. Recent foundation models like the Segment Anything Model (SAM) have shown remarkable zero-shot generalization capability in general segmentation tasks, making it a promising solution for few-shot scenarios. However, adapting SAM to CD-FSS faces two critical challenges: reliance on manual prompt and limited cross-domain ability. Therefore, we propose the Composable Meta-Prompt (CMP) framework that introduces three key modules: (i) the Reference Complement and Transformation (RCT) module for semantic expansion, (ii) the Composable Meta-Prompt Generation (CMPG) module for automated meta-prompt synthesis, and (iii) the Frequency-Aware Interaction (FAI) module for domain discrepancy mitigation. Evaluations across four cross-domain datasets demonstrate CMP's state-of-the-art performance, achieving 71.8% and 74.5% mIoU in 1-shot and 5-shot scenarios respectively.

Problem

Research questions and friction points this paper is trying to address.

Adapting SAM to Cross-Domain Few-Shot Segmentation challenges

Reducing reliance on manual prompts in segmentation tasks

Mitigating domain shifts in few-shot segmentation scenarios

Innovation

Methods, ideas, or system contributions that make the work stand out.

RCT module enables semantic expansion

CMPG automates meta-prompt synthesis

FAI mitigates domain discrepancy

🔎 Similar Papers

TAVP: Task-Adaptive Visual Prompt for Cross-domain Few-shot Segmentation

2024-09-09arXiv.orgCitations: 0

💼 Related Jobs

PhD GenAI Research Scientist Intern

Databricks

SF Bay Area Hourly Rate$54—$60 USD

San Francisco, CA, USA

Research Scientist Intern, Multimodal Generative AI and Robotics (PhD)