Learning local and global prototypes with optimal transport for unsupervised anomaly detection and localization

πŸ“… 2025-08-18
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This paper addresses unsupervised image anomaly detection and localization using only normal samples. Methodologically, it leverages a pre-trained encoder to extract features and jointly learns local and global prototypes; structured embedding representations are constructed via optimal transport, jointly encoding feature similarity and spatial layout. Prototype optimization explicitly models the intrinsic geometric structure of normal data, enhancing sensitivity to local inconsistencies. The key contribution is the integration of optimal transport with prototype learning, enabling structural-aware anomaly scoring through a combined feature-spatial cost. Evaluated on the MVTec AD and VisA industrial benchmarks, the method achieves state-of-the-art performance in both image-level detection and pixel-level localization, demonstrating the effectiveness and generalizability of structured representation learning.

Technology Category

Application Category

πŸ“ Abstract
Unsupervised anomaly detection aims to detect defective parts of a sample by having access, during training, to a set of normal, i.e. defect-free, data. It has many applications in fields, such as industrial inspection or medical imaging, where acquiring labels is costly or when we want to avoid introducing biases in the type of anomalies that can be spotted. In this work, we propose a novel UAD method based on prototype learning and introduce a metric to compare a structured set of embeddings that balances a feature-based cost and a spatial-based cost. We leverage this metric to learn local and global prototypes with optimal transport from latent representations extracted with a pre-trained image encoder. We demonstrate that our approach can enforce a structural constraint when learning the prototypes, allowing to capture the underlying organization of the normal samples, thus improving the detection of incoherencies in images. Our model achieves performance that is on par with strong baselines on two reference benchmarks for anomaly detection on industrial images. The code is available at https://github.com/robintrmbtt/pradot.
Problem

Research questions and friction points this paper is trying to address.

Detect defects in samples using only normal training data
Balance feature and spatial costs for anomaly detection
Learn structured prototypes to improve anomaly localization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Learning local and global prototypes
Optimal transport for anomaly detection
Balancing feature and spatial costs
πŸ”Ž Similar Papers
No similar papers found.
R
Robin Trombetta
Univ. Lyon, CNRS UMR 5220, Inserm U1294, INSA Lyon, UCBL, CREATIS, France
Carole Lartizien
Carole Lartizien
laboratoire CREATIS, CNRS, INSA, UniversitΓ© de Lyon
medical image analysismachine learning