🤖 AI Summary
Visual industrial anomaly detection suffers from a severe academic-industry gap: mainstream benchmarks rely on controlled laboratory data, failing to reflect the diversity and operational constraints of real-world production lines—leading to over 40% average performance degradation for state-of-the-art methods in practice and prohibitively high deployment costs. Method: We introduce VIAD—the first benchmark explicitly designed for real-world manufacturing scenarios—and propose a deployment-centric, multidimensional evaluation framework covering inference latency, few-shot robustness, and zero-shot generalization, alongside a cross-dataset fair evaluation protocol. Contribution/Results: Through systematic analysis of the root causes underlying the academic-industry disconnect, we provide a reproducible bridging methodology. The VIAD benchmark and evaluation codebase are open-sourced and have been validated in industrial deployments across multiple manufacturing enterprises.
📝 Abstract
Anomaly detection (AD) is essential for automating visual inspection in manufacturing. This field of computer vision is rapidly evolving, with increasing attention towards real-world applications. Meanwhile, popular datasets are typically produced in controlled lab environments with artificially created defects, unable to capture the diversity of real production conditions. New methods often fail in production settings, showing significant performance degradation or requiring impractical computational resources. This disconnect between academic results and industrial viability threatens to misdirect visual anomaly detection research. This paper makes three key contributions: (1) we demonstrate the importance of real-world datasets and establish benchmarks using actual production data, (2) we provide a fair comparison of existing SOTA methods across diverse tasks by utilizing metrics that are valuable for practical applications, and (3) we present a comprehensive analysis of recent advancements in this field by discussing important challenges and new perspectives for bridging the academia-industry gap. The code is publicly available at https://github.com/abc-125/viad-benchmark