Intermittent time series forecasting: local vs global models

📅 2026-01-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Probabilistic forecasting of intermittent time series—characterized by a high frequency of zero values—is crucial yet challenging in supply chain inventory management. This study systematically compares local models (e.g., iETS, TweedieGP) with global neural network approaches (e.g., D-Linear, DeepAR, Transformer) and, for the first time, integrates hurdle-shifted negative binomial and Tweedie distributions as probabilistic output heads for neural networks. Extensive experiments on over 40,000 real-world sequences demonstrate that D-Linear consistently outperforms local models in both predictive accuracy and computational efficiency. Among the proposed distributional heads, the negative binomial formulation yields the best overall performance, while the Tweedie distribution excels specifically in high quantile forecasting.

Technology Category

Application Category

📝 Abstract
Intermittent time series, characterised by the presence of a significant amount of zeros, constitute a large percentage of inventory items in supply chain. Probabilistic forecasts are needed to plan the inventory levels; the predictive distribution should cover non-negative values, have a mass in zero and a long upper tail. Intermittent time series are commonly forecast using local models, which are trained individually on each time series. In the last years global models, which are trained on a large collection of time series, have become popular for time series forecasting. Global models are often based on neural networks. However, they have not yet been exhaustively tested on intermittent time series. We carry out the first study comparing state-of-the-art local (iETS, TweedieGP) and global models (D-Linear, DeepAR, Transformers) on intermittent time series. For neural networks models we consider three different distribution heads suitable for intermittent time series: negative binomial, hurdle-shifted negative binomial and Tweedie. We use, for the first time, the last two distribution heads with neural networks. We perform experiments on five large datasets comprising more than 40'000 real-world time series. Among neural networks D-Linear provides best accuracy; it also consistently outperforms the local models. Moreover, it has also low computational requirements. Transformers-based architectures are instead much more computationally demanding and less accurate. Among the distribution heads, the Tweedie provides the best estimates of the highest quantiles, while the negative binomial offers overall the best performance.
Problem

Research questions and friction points this paper is trying to address.

intermittent time series
probabilistic forecasting
inventory planning
zero-inflated data
time series forecasting
Innovation

Methods, ideas, or system contributions that make the work stand out.

intermittent time series
global models
distribution heads
D-Linear
probabilistic forecasting
🔎 Similar Papers
No similar papers found.
S
Stefano Damato
Scuola Universitaria Professionale della Svizzera Italiana (SUPSI), Istituto Dalle Molle di Studi sull’Intelligenza Artificiale (IDSIA), Lugano, Switzerland
N
Nicolò Rubattu
Scuola Universitaria Professionale della Svizzera Italiana (SUPSI), Istituto Dalle Molle di Studi sull’Intelligenza Artificiale (IDSIA), Lugano, Switzerland
Dario Azzimonti
Dario Azzimonti
IDSIA
Gaussian processesSet estimation
Giorgio Corani
Giorgio Corani
IDSIA (Istituto Dalle Molle di Studi sull'Intelligenza Artificiale)
probabilistic graphical modelsprobabilistic forecastingBayesian staticsMachine Learning