CostFilter-AD: Enhancing Anomaly Detection through Matching Cost Filtering

πŸ“… 2025-05-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing unsupervised anomaly detection (UAD) methods suffer from inaccurate image- or feature-level matching due to noise sensitivity, edge blurring, and failure to capture subtle anomalies, thereby limiting detection performance. To address this, we propose the first cost volume modeling framework for UAD, integrated with a multi-layer attention-guided 3D convolutional filtering networkβ€”a plug-and-play post-processing module. Our approach dynamically refines matching costs via cross-layer attention, jointly preserving structural fidelity and enhancing anomaly sensitivity. The lightweight, modular architecture seamlessly supports both reconstruction-based and embedding-based UAD paradigms. Extensive experiments on MVTec-AD and VisA benchmarks demonstrate significant improvements in both single-class and multi-class anomaly detection, achieving state-of-the-art performance. Ablation studies confirm the generalizability and robustness of our method across diverse anomaly types and domains. Code and pretrained models are publicly available.

Technology Category

Application Category

πŸ“ Abstract
Unsupervised anomaly detection (UAD) seeks to localize the anomaly mask of an input image with respect to normal samples. Either by reconstructing normal counterparts (reconstruction-based) or by learning an image feature embedding space (embedding-based), existing approaches fundamentally rely on image-level or feature-level matching to derive anomaly scores. Often, such a matching process is inaccurate yet overlooked, leading to sub-optimal detection. To address this issue, we introduce the concept of cost filtering, borrowed from classical matching tasks, such as depth and flow estimation, into the UAD problem. We call this approach {em CostFilter-AD}. Specifically, we first construct a matching cost volume between the input and normal samples, comprising two spatial dimensions and one matching dimension that encodes potential matches. To refine this, we propose a cost volume filtering network, guided by the input observation as an attention query across multiple feature layers, which effectively suppresses matching noise while preserving edge structures and capturing subtle anomalies. Designed as a generic post-processing plug-in, CostFilter-AD can be integrated with either reconstruction-based or embedding-based methods. Extensive experiments on MVTec-AD and VisA benchmarks validate the generic benefits of CostFilter-AD for both single- and multi-class UAD tasks. Code and models will be released at https://github.com/ZHE-SAPI/CostFilter-AD.
Problem

Research questions and friction points this paper is trying to address.

Improving anomaly detection via cost filtering in unsupervised methods
Reducing matching noise while preserving edge structures in images
Enhancing both reconstruction-based and embedding-based anomaly detection approaches
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces cost filtering for anomaly detection
Uses cost volume filtering network
Generic post-processing plug-in design
πŸ”Ž Similar Papers
No similar papers found.
Z
Zhe Zhang
State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang, China
M
Mingxiu Cai
State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang, China
Hanxiao Wang
Hanxiao Wang
CASIA
Computer Graphics3D Generation
G
Gaochang Wu
State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang, China
Tianyou Chai
Tianyou Chai
Northeastern University China
modelingcontroloptimizationintegrated automation of industrial processesadaptive control
Xiatian Zhu
Xiatian Zhu
University of Surrey
Machine LearningComputer Vision