π€ AI Summary
This work addresses anomaly detection on low-dimensional manifold-structured data. We propose an efficient isolation-based method that embeds data into a high-dimensional semantic-enhanced preference space and employs Locality-Sensitive Hashing (LSH) to accelerate sparse neighborhood estimation, thereby identifying the most isolated samples as anomalies. To our knowledge, this is the first approach to integrate LSH into a preference-space isolation framework, achieving both theoretical soundness and computational efficiency. Extensive experiments on multiple benchmark datasets demonstrate state-of-the-art detection performance, with inference speed improved by 3β5Γ over existing methods, alongside substantial reductions in time and memory overhead. The source code is publicly available.
π Abstract
We focus on the problem of identifying samples in a set that do not conform to structured patterns represented by low-dimensional manifolds. An effective way to solve this problem is to embed data in a high dimensional space, called Preference Space, where anomalies can be identified as the most isolated points. In this work, we employ Locality Sensitive Hashing to avoid explicit computation of distances in high dimensions and thus improve Anomaly Detection efficiency. Specifically, we present an isolation-based anomaly detection technique designed to work in the Preference Space which achieves state-of-the-art performance at a lower computational cost. Code is publicly available at https://github.com/ineveLoppiliF/Hashing-for-Structure-based-Anomaly-Detection.