🤖 AI Summary
Existing open-set time series anomaly detection methods struggle to preserve temporal structure when generating unseen anomalies and are highly sensitive to training data contaminated with unlabeled anomalies. This work introduces influence functions to this task for the first time, leveraging them to quantify the impact of individual training samples on the model. By identifying and removing high-influence anomalous samples, the approach effectively decontaminates the training data. The cleaned data is then used to synthesize semantically diverse yet temporally coherent unseen anomalies. This unified framework simultaneously achieves high-quality anomaly generation and robust model training, significantly outperforming state-of-the-art methods across various open-set settings and contamination levels, thereby substantially improving detection accuracy.
📝 Abstract
Open-set anomaly detection (OSAD) is an emerging paradigm designed to utilize limited labeled data from anomaly classes seen in training to identify both seen and unseen anomalies during testing. Current approaches rely on simple augmentation methods to generate pseudo anomalies that replicate unseen anomalies. Despite being promising in image data, these methods are found to be ineffective in time series data due to the failure to preserve its sequential nature, resulting in trivial or unrealistic anomaly patterns. They are further plagued when the training data is contaminated with unlabeled anomalies. This work introduces $\textbf{IMPACT}$, a novel framework that leverages $\underline{\textbf{i}}$nfluence $\underline{\textbf{m}}$odeling for o$\underline{\textbf{p}}$en-set time series $\underline{\textbf{a}}$nomaly dete$\underline{\textbf{ct}}$ion, to tackle these challenges. The key insight is to $\textbf{i)}$ learn an influence function that can accurately estimate the impact of individual training samples on the modeling, and then $\textbf{ii)}$ leverage these influence scores to generate semantically divergent yet realistic unseen anomalies for time series while repurposing high-influential samples as supervised anomalies for anomaly decontamination. Extensive experiments show that IMPACT significantly outperforms existing state-of-the-art methods, showing superior accuracy under varying OSAD settings and contamination rates.