LLM-Enhanced Reinforcement Learning for Time Series Anomaly Detection

📅 2026-01-05

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

179K/year

🤖 AI Summary

This work addresses the challenges of sparse annotations, complex temporal patterns, and high labeling costs in time series anomaly detection by proposing a novel approach that integrates semantic priors from large language models (LLMs) with reinforcement learning. The method leverages LLM-generated semantic rewards to guide an LSTM-based policy exploration, while reconstruction errors from a variational autoencoder (VAE) provide unsupervised anomaly signals. To efficiently expand limited labeled data, it incorporates active learning and label propagation mechanisms. Innovatively, semantic knowledge from the LLM is employed for reward shaping, complemented by a VAE-driven dynamic reward scaling strategy. Evaluated on the Yahoo-A1 and SMD benchmarks, the proposed method achieves state-of-the-art detection accuracy under extremely low annotation budgets, significantly outperforming existing techniques.

Technology Category

Application Category

📝 Abstract

Detecting anomalies in time series data is crucial for finance, healthcare, sensor networks, and industrial monitoring applications. However, time series anomaly detection often suffers from sparse labels, complex temporal patterns, and costly expert annotation. We propose a unified framework that integrates Large Language Model (LLM)-based potential functions for reward shaping with Reinforcement Learning (RL), Variational Autoencoder (VAE)-enhanced dynamic reward scaling, and active learning with label propagation. An LSTM-based RL agent leverages LLM-derived semantic rewards to guide exploration, while VAE reconstruction errors add unsupervised anomaly signals. Active learning selects the most uncertain samples, and label propagation efficiently expands labeled data. Evaluations on Yahoo-A1 and SMD benchmarks demonstrate that our method achieves state-of-the-art detection accuracy under limited labeling budgets and operates effectively in data-constrained settings. This study highlights the promise of combining LLMs with RL and advanced unsupervised techniques for robust, scalable anomaly detection in real-world applications.

Problem

Research questions and friction points this paper is trying to address.

time series anomaly detection

sparse labels

complex temporal patterns

expert annotation cost

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-enhanced reinforcement learning

reward shaping

VAE-based anomaly detection

active learning