Reason-IAD: Knowledge-Guided Dynamic Latent Reasoning for Explainable Industrial Anomaly Detection

πŸ“… 2026-02-10
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing general-purpose multimodal large language models struggle to accurately identify category-specific fine-grained defects in industrial settings, resulting in insufficient detection accuracy and limited interpretability. To address this challenge, this work proposes a knowledge-guided dynamic latent-space reasoning framework that innovatively integrates retrieval-augmented category-specific textual knowledge, an entropy-driven optimizable implicit chain-of-thought token reasoning mechanism, and an information-theoretic dynamic visual patch injection strategy. This approach substantially enhances the model’s fine-grained understanding of industrial anomalies, outperforming state-of-the-art methods across multiple benchmarks while providing interpretable decision rationales.

Technology Category

Application Category

πŸ“ Abstract
Industrial anomaly detection demands precise reasoning over fine-grained defect patterns. However, existing multimodal large language models (MLLMs), pretrained on general-domain data, often struggle to capture category-specific anomalies, thereby limiting both detection accuracy and interpretability. To address these limitations, we propose Reason-IAD, a knowledge-guided dynamic latent reasoning framework for explainable industrial anomaly detection. Reason-IAD comprises two core components. First, a retrieval-augmented knowledge module incorporates category-specific textual descriptions into the model input, enabling context-aware reasoning over domain-specific defects. Second, an entropy-driven latent reasoning mechanism conducts iterative exploration within a compact latent space using optimizable latent think tokens, guided by an entropy-based reward that encourages confident and stable predictions. Furthermore, a dynamic visual injection strategy selectively incorporates the most informative image patches into the latent sequence, directing the reasoning process toward regions critical for anomaly detection. Extensive experimental results demonstrate that Reason-IAD consistently outperforms state-of-the-art methods. The code will be publicly available at https://github.com/chenpeng052/Reason-IAD.
Problem

Research questions and friction points this paper is trying to address.

Industrial Anomaly Detection
Multimodal Large Language Models
Fine-grained Defect Patterns
Explainability
Category-specific Anomalies
Innovation

Methods, ideas, or system contributions that make the work stand out.

knowledge-guided reasoning
latent reasoning
entropy-driven optimization
retrieval-augmented module
dynamic visual injection
πŸ”Ž Similar Papers
2024-05-02International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrowCitations: 0
P
Peng Chen
School of Cyber Science and Technology, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
Chao Huang
Chao Huang
Assistant Professor of AI & Data Science, University of Hong Kong
LLM AgentFoundation ModelGraph Machine LearningSpatio-Temporal Data MiningRecommendation
Yunkang Cao
Yunkang Cao
Hunan University
Visual Anomaly DetectionIndustrial Foundation ModelEmbodied Intelligence
C
Chengliang Liu
Department of Computer and Information Science, University of Macau, Macau, China
W
Wenqiang Wang
School of Cyber Science and Technology, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
M
Mingbo Yang
School of Cyber Science and Technology, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
Li Shen
Li Shen
Associate Professor, Sun Yat-sen University
Machine LearningOptimization
Wenqi Ren
Wenqi Ren
Sun Yat-sen University
Computer VisionImage ProcessingArtificial IntelligenceImage Restoration
Xiaochun Cao
Xiaochun Cao
Sun Yat-sen University
Computer VisionArtificial IntelligenceMultimediaMachine Learning