Lossless Compression: A New Benchmark for Time Series Model Evaluation

📅 2025-09-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing time-series model evaluation focuses on downstream tasks—e.g., forecasting, imputation, anomaly detection, and classification—but lacks rigorous assessment of how well models capture the underlying data-generating distribution. Method: We propose lossless compression as a novel, theoretically grounded evaluation paradigm, leveraging Shannon’s source coding theorem to equate optimal compression length with negative log-likelihood, thereby establishing a unified information-theoretic benchmark. We introduce TSCom-Bench—a standardized protocol and open-source framework—that enables rapid adaptation of state-of-the-art models (e.g., TimeXer, iTransformer, PatchTST) as compression backbones. Contribution/Results: Experiments across diverse time-series datasets demonstrate that our approach effectively uncovers latent deficiencies in distribution modeling by leading models, significantly enhancing evaluation rigor and depth. TSCom-Bench constitutes the first general-purpose, full-distribution-oriented benchmark for time-series generative modeling.

Technology Category

Application Category

📝 Abstract
The evaluation of time series models has traditionally focused on four canonical tasks: forecasting, imputation, anomaly detection, and classification. While these tasks have driven significant progress, they primarily assess task-specific performance and do not rigorously measure whether a model captures the full generative distribution of the data. We introduce lossless compression as a new paradigm for evaluating time series models, grounded in Shannon's source coding theorem. This perspective establishes a direct equivalence between optimal compression length and the negative log-likelihood, providing a strict and unified information-theoretic criterion for modeling capacity. Then We define a standardized evaluation protocol and metrics. We further propose and open-source a comprehensive evaluation framework TSCom-Bench, which enables the rapid adaptation of time series models as backbones for lossless compression. Experiments across diverse datasets on state-of-the-art models, including TimeXer, iTransformer, and PatchTST, demonstrate that compression reveals distributional weaknesses overlooked by classic benchmarks. These findings position lossless compression as a principled task that complements and extends existing evaluation for time series modeling.
Problem

Research questions and friction points this paper is trying to address.

Evaluating time series models beyond traditional task-specific performance metrics
Measuring whether models capture the full generative distribution of data
Establishing a unified information-theoretic criterion for modeling capacity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Lossless compression as evaluation paradigm
Equivalence between compression length and log-likelihood
TSCom-Bench framework for model compression adaptation
🔎 Similar Papers
No similar papers found.
M
Meng Wan
Computer Network Information Center, Chinese Academy of Sciences
B
Benxi Tian
North China Electric Power University
J
Jue Wang
Computer Network Information Center, Chinese Academy of Sciences
C
Cui Hui
Software Institute of Chinaunicom
N
Ningming Nie
Computer Network Information Center, Chinese Academy of Sciences
T
Tiantian Liu
Computer Network Information Center, Chinese Academy of Sciences
Z
Zongguo Wang
Computer Network Information Center, Chinese Academy of Sciences
C
Cao Rongqiang
Computer Network Information Center, Chinese Academy of Sciences
P
Peng Shi
University of Science and Technology Beijing
Yangang Wang
Yangang Wang
Professor, Southeast University
Computer graphicsComputer visionComputational photography