$spadesuit$ SPADE $spadesuit$ Split Peak Attention DEcomposition

πŸ“… 2024-11-06
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address demand forecasting overshoot during peak events (e.g., promotions, holidays) and subsequent post-peak prediction bias, this paper proposes a decoupled forecasting framework that decomposes demand into two subtasks: peak-event and non-peak-event modeling. We introduce a novel hybrid architecture integrating masked convolution and peak attention: the masked convolution filters out peak-induced noise by suppressing interference outside temporally bounded event windows, while the peak attention module dynamically captures the evolving temporal influence of events. A dual-path temporal decomposition further enhances feature disentanglement. Evaluated on a global retail dataset comprising billions of SKUs, our method reduces post-peak forecasting error by 4.5% and improves peak-period accuracy by 3.9%, significantly outperforming state-of-the-art end-to-end time-series models.

Technology Category

Application Category

πŸ“ Abstract
Demand forecasting faces challenges induced by Peak Events (PEs) corresponding to special periods such as promotions and holidays. Peak events create significant spikes in demand followed by demand ramp down periods. Neural networks like MQCNN and MQT overreact to demand peaks by carrying over the elevated PE demand into subsequent Post-Peak-Event (PPE) periods, resulting in significantly over-biased forecasts. To tackle this challenge, we introduce a neural forecasting model called Split Peak Attention DEcomposition, SPADE. This model reduces the impact of PEs on subsequent forecasts by modeling forecasting as consisting of two separate tasks: one for PEs; and the other for the rest. Its architecture then uses masked convolution filters and a specialized Peak Attention module. We show SPADE's performance on a worldwide retail dataset with hundreds of millions of products. Our results reveal a reduction in PPE degradation by 4.5% and an improvement in PE accuracy by 3.9%, relative to current production models.
Problem

Research questions and friction points this paper is trying to address.

Demand Forecasting
Seasonal Variability
Prediction Accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

SPADE model
Split Peak Attention
demand peak handling
πŸ”Ž Similar Papers
M
Malcolm Wolff
Amazon
Kin G. Olivares
Kin G. Olivares
Carnegie Mellon University
machine learningstatisticsforecasting
Boris N. Oreshkin
Boris N. Oreshkin
Principal Scientist at Amazon
Machine Learning
S
Sunny Ruan
Amazon
S
Sitan Yang
Amazon
A
Abhinav Katoch
Amazon
S
Shankar Ramasubramanian
Amazon
Y
Youxin Zhang
Amazon
M
Michael W. Mahoney
Amazon
Dmitry Efimov
Dmitry Efimov
Amazon
Machine LearningData ScienceFeature selectionBioinformatics
V
Vincent Quenneville-BΓ©lair
Amazon