AutoSTF: Decoupled Neural Architecture Search for Cost-Effective Automated Spatio-Temporal Forecasting

📅 2024-09-25

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

176K/year

🤖 AI Summary

Existing neural architecture search (NAS) methods for spatiotemporal forecasting in smart cities suffer from high computational overhead and limited support for fine-grained spatiotemporal operator exploration. Method: We propose a decoupled NAS framework featuring: (i) a novel spatiotemporal decoupling search paradigm that separates spatial and temporal modeling; (ii) representation compression and parameter sharing to reduce search cost; (iii) multi-block transfer modules for multi-granularity temporal modeling; and (iv) a layer-level fine-grained spatial search space. The method integrates spatiotemporal graph neural networks with differentiable NAS optimization. Contribution/Results: Our approach achieves state-of-the-art accuracy on eight benchmark datasets while accelerating the search process by up to 13.48×. It significantly reduces computational cost without compromising predictive performance, enabling efficient and scalable architecture discovery for urban spatiotemporal forecasting.

Technology Category

Application Category

📝 Abstract

Spatio-temporal forecasting is a critical component of various smart city applications, such as transportation optimization, energy management, and socio-economic analysis. Recently, several automated spatio-temporal forecasting methods have been proposed to automatically search the optimal neural network architecture for capturing complex spatio-temporal dependencies. However, the existing automated approaches suffer from expensive neural architecture search overhead, which hinders their practical use and the further exploration of diverse spatio-temporal operators in a finer granularity. In this paper, we propose AutoSTF, a decoupled automatic neural architecture search framework for cost-effective automated spatio-temporal forecasting. From the efficiency perspective, we first decouple the mixed search space into temporal space and spatial space and respectively devise representation compression and parameter-sharing schemes to mitigate the parameter explosion. The decoupled spatio-temporal search not only expedites the model optimization process but also leaves new room for more effective spatio-temporal dependency modeling. From the effectiveness perspective, we propose a multi-patch transfer module to jointly capture multi-granularity temporal dependencies and extend the spatial search space to enable finer-grained layer-wise spatial dependency search. Extensive experiments on eight datasets demonstrate the superiority of AutoSTF in terms of both accuracy and efficiency. Specifically, our proposed method achieves up to 13.48x speed-up compared to state-of-the-art automatic spatio-temporal forecasting methods while maintaining the best forecasting accuracy.

Problem

Research questions and friction points this paper is trying to address.

Automated Model Optimization

Computational Resource Consumption

Spatio-Temporal Prediction

Innovation

Methods, ideas, or system contributions that make the work stand out.

AutoSTF

spatio-temporal prediction

model optimization

🔎 Similar Papers

Optimizing Time Series Forecasting Architectures: A Hierarchical Neural Architecture Search Approach