SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction

📅 2025-11-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing spatiotemporal forecasting models incur prohibitive computational overhead and suffer from severe data and feature redundancy on resource-constrained edge devices. Method: This paper proposes the first efficient spatiotemporal modeling framework explicitly leveraging data sparsity. It introduces a sparsely activated ConvLSTM architecture, a data-sparsity-aware training strategy, and a redundant-feature filtering mechanism; further, it employs a multi-objective composite loss function to jointly optimize accuracy and efficiency, enabling customizable Pareto-optimal trade-offs. Contribution/Results: It is the first work to explicitly incorporate data sparsity modeling into the spatiotemporal forecasting pipeline. Evaluated on real-world edge tasks—including traffic flow and medical time-series prediction—the framework achieves an average 2.3× inference speedup and 47% memory reduction over baseline models, while matching state-of-the-art accuracy. This establishes a novel paradigm for lightweight spatiotemporal intelligence.

Technology Category

Application Category

📝 Abstract
Spatiotemporal data mining (STDM) has a wide range of applications in various complex physical systems (CPS), i.e., transportation, manufacturing, healthcare, etc. Among all the proposed methods, the Convolutional Long Short-Term Memory (ConvLSTM) has proved to be generalizable and extendable in different applications and has multiple variants achieving state-of-the-art performance in various STDM applications. However, ConvLSTM and its variants are computationally expensive, which makes them inapplicable in edge devices with limited computational resources. With the emerging need for edge computing in CPS, efficient AI is essential to reduce the computational cost while preserving the model performance. Common methods of efficient AI are developed to reduce redundancy in model capacity (i.e., model pruning, compression, etc.). However, spatiotemporal data mining naturally requires extensive model capacity, as the embedded dependencies in spatiotemporal data are complex and hard to capture, which limits the model redundancy. Instead, there is a fairly high level of data and feature redundancy that introduces an unnecessary computational burden, which has been largely overlooked in existing research. Therefore, we developed a novel framework SparseST, that pioneered in exploiting data sparsity to develop an efficient spatiotemporal model. In addition, we explore and approximate the Pareto front between model performance and computational efficiency by designing a multi-objective composite loss function, which provides a practical guide for practitioners to adjust the model according to computational resource constraints and the performance requirements of downstream tasks.
Problem

Research questions and friction points this paper is trying to address.

Reducing computational cost of spatiotemporal models for edge devices
Addressing data and feature redundancy in spatiotemporal data mining
Balancing model performance with computational efficiency constraints
Innovation

Methods, ideas, or system contributions that make the work stand out.

Exploiting data sparsity for efficient spatiotemporal modeling
Designing multi-objective loss for performance-efficiency tradeoff
Reducing computational burden through feature redundancy elimination
Junfeng Wu
Junfeng Wu
Huazhong University of Science and Technology
Computer Vision
Hadjer Benmeziane
Hadjer Benmeziane
Research Scientist @ IBM Research
Efficient Deep LearningAutoMLNAS
K
K. E. Maghraoui
IBM Research
L
Liu Liu
Department of Electrical, Computer, and Systems Engineering and the Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180
Y
Yinan Wang
Department of Industrial and Systems Engineering, Rensselaer Polytechnic Institute, Troy, NY, 12180