🤖 AI Summary
Existing CLIP-based foreground-background decomposition methods for few-shot out-of-distribution (OOD) detection employ a uniform background suppression strategy and overlook semantically ambiguous local regions within the foreground, thereby limiting performance. To address these limitations, this work proposes a plug-and-play framework that refines semantic control over image regions through three key components: foreground-background decomposition, entropy-weighted adaptive background suppression, and confusion-aware foreground correction based on local semantic similarity analysis. By moving beyond the conventional practice of treating all background regions equally and ignoring intra-foreground semantic ambiguities, the proposed method achieves significant performance gains over existing foreground-background approaches across multiple benchmarks, demonstrating its effectiveness and generalizability.
📝 Abstract
CLIP-based foreground-background (FG-BG) decomposition methods have demonstrated remarkable effectiveness in improving few-shot out-of-distribution (OOD) detection performance. However, existing approaches still suffer from several limitations. For background regions obtained from decomposition, existing methods adopt a uniform suppression strategy for all patches, overlooking the varying contributions of different patches to the prediction. For foreground regions, existing methods fail to adequately consider that some local patches may exhibit appearance or semantic similarity to other classes, which may mislead the training process. To address these issues, we propose a new plug-and-play framework. This framework consists of three core components: (1) a Foreground-Background Decomposition module, which follows previous FG-BG methods to separate an image into foreground and background regions; (2) an Adaptive Background Suppression module, which adaptively weights patch classification entropy; and (3) a Confusable Foreground Rectification module, which identifies and rectifies confusable foreground patches. Extensive experimental results demonstrate that the proposed plug-and-play framework significantly improves the performance of existing FG-BG decomposition methods. Code is available at: https://github.com/lounwb/FoBoR.