Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection

πŸ“… 2025-02-17
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
In industrial quality inspection, the scarcity of genuine anomalous samples limits supervised anomaly detection, while existing generative methods often produce physically implausible pseudo-anomalies, leading to high false-positive rates. To address this, we propose ComGENβ€”a component-aware, unsupervised logical anomaly generation framework. ComGEN models part structures via multi-scale component disentanglement and explicitly encodes intrinsic logical constraints among components through text-component-aligned attention-guided residual mapping, enabling interpretable anomaly synthesis without real anomaly samples. It further incorporates iterative reference matching and unsupervised generative editing to enhance generation plausibility and generalizability. Evaluated on MVTecLOCO, ComGEN achieves 91.2% AUROC. Extensive validation on a diesel engine production line and MVTecAD demonstrates significant reductions in false positives and consistent improvements in downstream anomaly detection performance.

Technology Category

Application Category

πŸ“ Abstract
Anomaly detection is critical in industrial manufacturing for ensuring product quality and improving efficiency in automated processes. The scarcity of anomalous samples limits traditional detection methods, making anomaly generation essential for expanding the data repository. However, recent generative models often produce unrealistic anomalies increasing false positives, or require real-world anomaly samples for training. In this work, we treat anomaly generation as a compositional problem and propose ComGEN, a component-aware and unsupervised framework that addresses the gap in logical anomaly generation. Our method comprises a multi-component learning strategy to disentangle visual components, followed by subsequent generation editing procedures. Disentangled text-to-component pairs, revealing intrinsic logical constraints, conduct attention-guided residual mapping and model training with iteratively matched references across multiple scales. Experiments on the MVTecLOCO dataset confirm the efficacy of ComGEN, achieving the best AUROC score of 91.2%. Additional experiments on the real-world scenario of Diesel Engine and widely-used MVTecAD dataset demonstrate significant performance improvements when integrating simulated anomalies generated by ComGEN into automated production workflows.
Problem

Research questions and friction points this paper is trying to address.

Unrealistic anomaly generation
Scarcity of anomalous samples
Logical anomaly detection in industry
Innovation

Methods, ideas, or system contributions that make the work stand out.

Component-aware unsupervised framework
Multi-component learning strategy
Attention-guided residual mapping
πŸ”Ž Similar Papers
No similar papers found.
X
Xuan Tong
Shanghai Engineering Research Center of AI & Robotics, Academy for Engineering & Technology, Fudan University, Shanghai, China
Yang Chang
Yang Chang
Shanghai Engineering Research Center of AI & Robotics, Academy for Engineering & Technology, Fudan University, Shanghai, China
Q
Qing Zhao
Shanghai Engineering Research Center of AI & Robotics, Academy for Engineering & Technology, Fudan University, Shanghai, China
J
Jiawen Yu
Shanghai Engineering Research Center of AI & Robotics, Academy for Engineering & Technology, Fudan University, Shanghai, China
B
Boyang Wang
Shanghai Engineering Research Center of AI & Robotics, Academy for Engineering & Technology, Fudan University, Shanghai, China
Junxiong Lin
Junxiong Lin
Fudan University
Computer Vision
Yuxuan Lin
Yuxuan Lin
College of Computer Science and Artificial Intelligence, Fudan University
Computer VisionMultimodal LearningEmbodied AI
Xinji Mai
Xinji Mai
Fudan University
ReasoningAgent RL
H
Haoran Wang
Shanghai Engineering Research Center of AI & Robotics, Academy for Engineering & Technology, Fudan University, Shanghai, China
Zeng Tao
Zeng Tao
Fudan University
Deep LearningComputer VisionGenerative Model3D VisionWorld Models
Y
Yan Wang
Shanghai Engineering Research Center of AI & Robotics, Academy for Engineering & Technology, Fudan University, Shanghai, China
W
Wenqiang Zhang
Engineering Research Center of AI & Robotics, Ministry of Education, Academy for Engineering & Technology, Fudan University, Shanghai, China