Deep Time Series Models: A Comprehensive Survey and Benchmark

📅 2024-07-18
🏛️ arXiv.org
📈 Citations: 105
Influential: 4
📄 PDF
🤖 AI Summary
Deep learning for time series lacks standardized evaluation benchmarks and systematic architectural analysis. Method: We introduce TSLib, the first standardized deep time-series benchmark library, encompassing 24 state-of-the-art models, 30 cross-domain datasets, and five core tasks. It decouples modeling paradigms along two dimensions—fundamental modules and holistic architectures—to establish a modular, reproducible, and fair evaluation framework. Contribution/Results: Through extensive experiments, we uncover, for the first time, strong structural-task alignment patterns—i.e., specific architectural characteristics (e.g., attention mechanisms, convolutional depth, or recurrence design) exhibit consistent performance advantages on particular task types (e.g., forecasting, classification, anomaly detection). This finding provides empirical guidance for model selection and architecture design. Comprehensive evaluation across 12 advanced models validates the robustness of this insight. All code, data, and evaluation pipelines are publicly released and have garnered significant attention from the research community.

Technology Category

Application Category

📝 Abstract
Time series, characterized by a sequence of data points arranged in a discrete-time order, are ubiquitous in real-world applications. Different from other modalities, time series present unique challenges due to their complex and dynamic nature, including the entanglement of nonlinear patterns and time-variant trends. Analyzing time series data is of great significance in real-world scenarios and has been widely studied over centuries. Recent years have witnessed remarkable breakthroughs in the time series community, with techniques shifting from traditional statistical methods to advanced deep learning models. In this paper, we delve into the design of deep time series models across various analysis tasks and review the existing literature from two perspectives: basic modules and model architectures. Further, we develop and release Time Series Library (TSLib) as a fair benchmark of deep time series models for diverse analysis tasks, which implements 24 mainstream models, covers 30 datasets from different domains, and supports five prevalent analysis tasks. Based on TSLib, we thoroughly evaluate 12 advanced deep time series models on different tasks. Empirical results indicate that models with specific structures are well-suited for distinct analytical tasks, which offers insights for research and adoption of deep time series models. Code is available at https://github.com/thuml/Time-Series-Library.
Problem

Research questions and friction points this paper is trying to address.

Surveying deep learning approaches for time series analysis
Benchmarking model performance across diverse analytical tasks
Developing comprehensive library for time series model evaluation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Developed comprehensive benchmark for deep time series models
Implemented 30 models across 30 multi-domain datasets
Evaluated model-task suitability through empirical analysis
Y
Yuxuan Wang
School of Software, BNRist, Tsinghua University, Beijing 100084, China
Haixu Wu
Haixu Wu
Ph.D. student, Tsinghua University
Deep learningScientific Machine Learning
Jiaxiang Dong
Jiaxiang Dong
School of Software, BNRist, Tsinghua University, Beijing 100084, China
Y
Yong Liu
School of Software, BNRist, Tsinghua University, Beijing 100084, China
Mingsheng Long
Mingsheng Long
Associate Professor, Tsinghua University
Machine learningdeep learningtransfer learningscientific machine learning
J
Jianmin Wang
School of Software, BNRist, Tsinghua University, Beijing 100084, China