🤖 AI Summary
This study addresses the limited generalization capability of existing anomaly detection methods when applied to highly complex, multi-stage industrial time-series data, where capturing aperiodic and multi-scale dynamic patterns remains challenging. For the first time, the authors systematically evaluate a range of unsupervised models—including Isolation Forest, recurrent autoencoders, variational autoencoders, and temporal convolutional autoencoders (TCAE)—in real-world, high-complexity industrial settings. Their experiments reveal significant limitations of classical approaches while demonstrating the superior effectiveness of deep autoencoder architectures. Among these, TCAE exhibits the most robust performance, substantially outperforming competing methods and offering a reliable solution for anomaly detection in intricate industrial processes.
📝 Abstract
Industrial time-series data from real production environments exhibits substantially higher complexity than commonly used benchmark datasets, primarily due to heterogeneous, multi-stage operational processes. As a result, anomaly detection methods validated under simplified conditions often fail to generalize to industrial settings. This work presents an empirical study on a unique dataset collected from fully operational industrial machinery, explicitly capturing pronounced process-induced variability.
We evaluate which model classes are capable of capturing this complexity, starting with a classical Isolation Forest baseline and extending to multiple autoencoder architectures. Experimental results show that Isolation Forest is insufficient for modeling the non-periodic, multi-scale dynamics present in the data, whereas autoencoders consistently perform better. Among them, temporal convolutional autoencoders achieve the most robust performance, while recurrent and variational variants require more careful tuning.