Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis

📅 2023-10-31
🏛️ Neural Information Processing Systems
📈 Citations: 12
Influential: 0
📄 PDF
🤖 AI Summary
To address data scarcity and inconsistent performance in road network traffic accident prediction, this paper introduces the first large-scale, publicly available U.S. multi-state unified road safety dataset—comprising 9 million crash records—and integrates road network topology with traffic flow features. We propose a Graph Neural Network (GraphSAGE)-based multi-task transfer learning framework that jointly models traffic volume and crash risk while mitigating inter-state label distribution shifts. Experiments show an average absolute error <22% for crash count prediction and an AUROC of 87.3% for crash occurrence classification; ablation studies confirm the substantial contribution of graph-structured features. Our key contributions are: (1) releasing the first standardized, reproducible large-scale road crash benchmark dataset; and (2) designing a multi-task transfer graph learning paradigm explicitly tailored to geographic heterogeneity.
📝 Abstract
We consider the problem of traffic accident analysis on a road network based on road network connections and traffic volume. Previous works have designed various deep-learning methods using historical records to predict traffic accident occurrences. However, there is a lack of consensus on how accurate existing methods are, and a fundamental issue is the lack of public accident datasets for comprehensive evaluations. This paper constructs a large-scale, unified dataset of traffic accident records from official reports of various states in the US, totaling 9 million records, accompanied by road networks and traffic volume reports. Using this new dataset, we evaluate existing deep-learning methods for predicting the occurrence of accidents on road networks. Our main finding is that graph neural networks such as GraphSAGE can accurately predict the number of accidents on roads with less than 22% mean absolute error (relative to the actual count) and whether an accident will occur or not with over 87% AUROC, averaged over states. We achieve these results by using multitask learning to account for cross-state variabilities (e.g., availability of accident labels) and transfer learning to combine traffic volume with accident prediction. Ablation studies highlight the importance of road graph-structural features, amongst other features. Lastly, we discuss the implications of the analysis and develop a package for easily using our new dataset.
Problem

Research questions and friction points this paper is trying to address.

Lack of public accident datasets for comprehensive road safety evaluations
Uncertain accuracy of existing deep-learning methods for accident prediction
Need to analyze traffic accidents using road networks and traffic volume
Innovation

Methods, ideas, or system contributions that make the work stand out.

Constructed large-scale unified US accident dataset
Applied GraphSAGE neural networks for prediction
Used multitask and transfer learning techniques
🔎 Similar Papers
No similar papers found.
A
Abhinav Nippani
Northeastern University, Boston
D
Dongyue Li
Northeastern University, Boston
H
Haotian Ju
Northeastern University, Boston
H
Haris N. Koutsopoulos
Northeastern University, Boston
H
Hongyang R. Zhang
Northeastern University, Boston