Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation

📅 2025-04-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing zero-shot anomaly segmentation models rely on fixed textual prompts, resulting in poor cross-industrial adaptability. To address this, we propose an image-aware dynamic prompt generation framework that pioneers the integration of vision annotation models with large language models (LLMs) for prompt synthesis. Given an input image, our method automatically extracts semantic attributes and generates context-aware, scene-adaptive textual prompts to guide zero-shot segmentation. Crucially, it eliminates manual prompt engineering while significantly enhancing model generalization. Evaluated on multiple industrial anomaly segmentation benchmarks, our approach achieves up to a 10% improvement in F1-max over prior methods. This advancement substantially improves robustness and practicality in dynamic, unstructured industrial environments.

Technology Category

Application Category

📝 Abstract
Anomaly segmentation is essential for industrial quality, maintenance, and stability. Existing text-guided zero-shot anomaly segmentation models are effective but rely on fixed prompts, limiting adaptability in diverse industrial scenarios. This highlights the need for flexible, context-aware prompting strategies. We propose Image-Aware Prompt Anomaly Segmentation (IAP-AS), which enhances anomaly segmentation by generating dynamic, context-aware prompts using an image tagging model and a large language model (LLM). IAP-AS extracts object attributes from images to generate context-aware prompts, improving adaptability and generalization in dynamic and unstructured industrial environments. In our experiments, IAP-AS improves the F1-max metric by up to 10%, demonstrating superior adaptability and generalization. It provides a scalable solution for anomaly segmentation across industries
Problem

Research questions and friction points this paper is trying to address.

Enhances anomaly segmentation with dynamic prompts
Improves adaptability in diverse industrial scenarios
Generates context-aware prompts using image attributes
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic context-aware prompts generation
Image tagging and LLM for prompts
Improves adaptability in industrial scenarios
🔎 Similar Papers
No similar papers found.
S
SoYoung Park
Department of Computer Science and Engineering, Chungnam National University, Daejeon, South Korea
Hyewon Lee
Hyewon Lee
Department of Computer Science and Engineering, Chungnam National University
GNN
M
Mingyu Choi
Department of Computer Science and Engineering, Chungnam National University, Daejeon, South Korea
Seunghoon Han
Seunghoon Han
Chungnam National University
Graph Neural Networks
Jong-Ryul Lee
Jong-Ryul Lee
Chungnam National University (CNU)
Graph TheoryDeep LearningModel Compression
S
Sungsu Lim
Department of Computer Science and Engineering, Chungnam National University, Daejeon, South Korea
T
Tae-Ho Kim
Nota Inc., Seoul, South Korea