An Experimental Evaluation of Imputation Models for Spatial-Temporal Traffic Data

📅 2024-12-06
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traffic data imputation faces three major challenges: complex missingness patterns, inconsistent benchmarks, and overly narrow evaluation criteria. To address these, this work introduces the first spatiotemporal joint missingness pattern taxonomy and establishes a standardized evaluation framework encompassing ten representative model families—including graph neural networks (GNNs), recurrent neural networks (RNNs), Transformers, and matrix factorization methods. We conduct systematic assessments across synthetic and real-world traffic datasets, measuring effectiveness, computational efficiency, and robustness. Through controlled experiments varying both missing rates and patterns, we empirically characterize the performance boundaries of each model class under diverse missingness scenarios—marking the first such comprehensive analysis. Our findings yield a reproducible, comparable model selection guideline for traffic imputation and advance the field toward standardization and practical deployment.

Technology Category

Application Category

📝 Abstract
Traffic data imputation is a critical preprocessing step in intelligent transportation systems, enabling advanced transportation services. Despite significant advancements in this field, selecting the most suitable model for practical applications remains challenging due to three key issues: 1) incomprehensive consideration of missing patterns that describe how data loss along spatial and temporal dimensions, 2) the lack of test on standardized datasets, and 3) insufficient evaluations. To this end, we first propose practice-oriented taxonomies for missing patterns and imputation models, systematically identifying all possible forms of real-world traffic data loss and analyzing the characteristics of existing models. Furthermore, we introduce a unified benchmarking pipeline to comprehensively evaluate 10 representative models across various missing patterns and rates. This work aims to provide a holistic understanding of traffic data imputation research and serve as a practical guideline.
Problem

Research questions and friction points this paper is trying to address.

Lack of taxonomy for traffic data imputation models
Absence of unified benchmarking pipeline for evaluation
Insufficient multidimensional analysis of model performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposed practice-oriented taxonomies for missing patterns
Introduced unified benchmarking pipeline for model evaluation
Comprehensively assessed 11 models across multiple dimensions
🔎 Similar Papers
No similar papers found.
S
S. Guo
School of Computer Science and Technology, Beijing Jiaotong University, Beijing, China
Tonglong Wei
Tonglong Wei
Beijing jiaotong university
data mining
Yiheng Huang
Yiheng Huang
Fudan University
Software Supply Chain
M
Miaomiao Zhao
School of Computer Science and Technology, Beijing Jiaotong University, Beijing, China
R
Ran Chen
School of Computer Science and Technology, Beijing Jiaotong University, Beijing, China
Y
Yan Lin
Department of Computer Science, Aalborg University, Aalborg, Denmark
Y
Youfang Lin
School of Computer Science and Technology, Beijing Jiaotong University, Beijing, China
H
Huaiyu Wan
School of Computer Science and Technology, Beijing Jiaotong University, Beijing, China