Spatial PDE-aware Selective State-space with Nested Memory for Mobile Traffic Grid Forecasting

๐Ÿ“… 2026-03-12
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenges of strong spatiotemporal dependencies, high spatial heterogeneity, and substantial computational overhead in large-scale cellular network mobile traffic grid forecasting by proposing the NeST-S6 model. NeST-S6 integrates convolutional local mixing, a partial differential equation (PDE)-aware selective state space architecture, and a nested learning optimizer-driven long-term memory mechanism to effectively capture heterogeneous dynamics and adapt to distribution shifts. Evaluated on the Milan dataset, NeST-S6 significantly outperforms Mamba-based baselines, achieving a 32ร— acceleration in full-grid reconstruction, a 4.3ร— reduction in MACs, a 61% decrease in per-pixel RMSE, and a 48%โ€“65% improvement in MAE under distribution shift scenarios.

Technology Category

Application Category

๐Ÿ“ Abstract
Traffic forecasting in cellular networks is a challenging spatiotemporal prediction problem due to strong temporal dependencies, spatial heterogeneity across cells, and the need for scalability to large network deployments. Traditional cell-specific models incur prohibitive training and maintenance costs, while global models often fail to capture heterogeneous spatial dynamics. Recent spatiotemporal architectures based on attention or graph neural networks improve accuracy but introduce high computational overhead, limiting their applicability in large-scale or real-time settings. We study spatiotemporal grid forecasting, where each time step is a 2D lattice of traffic values, and predict the next grid patch using previous patches. We propose NeST-S6, a convolutional selective state-space model (SSM) with a spatial PDE-aware core, implemented in a nested learning paradigm: convolutional local spatial mixing feeds a spatial PDE-aware SSM core, while a nested-learning long-term memory is updated by a learned optimizer when one-step prediction errors indicate unmodeled dynamics. On the mobile-traffic grid (Milan dataset) at three resolutions (202, 502, 1002), NeST-S6 attains lower errors than a strong Mamba-family baseline in both single-step and 6-step autoregressive rollouts. Under drift stress tests, our model's nested memory lowers MAE by 48-65% over a no-memory ablation. NeST-S6 also speeds full-grid reconstruction by 32 times and reduces MACs by 4.3 times compared to competitive per-pixel scanning models, while achieving 61% lower per-pixel RMSE.
Problem

Research questions and friction points this paper is trying to address.

traffic forecasting
spatiotemporal prediction
spatial heterogeneity
scalability
mobile networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Selective State-Space Model
Spatial PDE-aware
Nested Memory
Spatiotemporal Forecasting
Mobile Traffic Grid
๐Ÿ”Ž Similar Papers
No similar papers found.