🤖 AI Summary
This work addresses the challenge of directly applying Segment Anything Model (SAM) to camouflaged object detection (COD), where SAM’s generic priors fail to capture subtle camouflage patterns. We propose a lightweight, interpretable keypoint-guided framework: (1) a multi-scale probabilistic prompt localization network (PPT-Net) that predicts salient keypoints; and (2) a contrastive keypoint selection (KPS) algorithm that identifies discriminative positive/negative prompts for zero-shot adaptation of SAM. By integrating multi-scale feature encoding with point-level existence probability modeling, our method enhances SAM’s fine-grained segmentation capability for COD. Evaluated on three standard COD benchmarks across six metrics, our approach consistently outperforms state-of-the-art methods. Results demonstrate SAM’s effective, generalizable, and interpretable “plug-and-play” applicability to COD—establishing a novel paradigm for adapting foundation models to low-resource, high-difficulty vision tasks.
📝 Abstract
Big model has emerged as a new research paradigm that can be applied to various down-stream tasks with only minor effort for domain adaption. Correspondingly, this study tackles Camouflaged Object Detection (COD) leveraging the Segment Anything Model (SAM). The previous studies declared that SAM is not workable for COD but this study reveals that SAM works if promoted properly, for which we devise a new framework to render point promotions: First, we develop the Promotion Point Targeting Network (PPT-net) to leverage multi-scale features in predicting the probabilities of camouflaged objects' presences at given candidate points over the image. Then, we develop a key point selection (KPS) algorithm to deploy both positive and negative point promotions contrastively to SAM to guide the segmentation. It is the first work to facilitate big model for COD and achieves plausible results experimentally over the existing methods on 3 data sets under 6 metrics. This study demonstrates an off-the-shelf methodology for COD by leveraging SAM, which gains advantage over designing professional models from scratch, not only in performance, but also in turning the problem to a less challenging task, that is, seeking informative but not exactly precise promotions.