RSTGCN: Railway-centric Spatio-Temporal Graph Convolutional Network for Train Delay Prediction

📅 2025-09-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the critical problem of predicting station-level average arrival delays in railway operations. Unlike conventional single-train delay forecasting, our work models the average delay across all arriving trains at a station within a given time window. To this end, we propose an end-to-end spatiotemporal graph learning framework tailored for large-scale rail networks. Our method introduces a train-frequency-aware spatial attention mechanism, jointly integrating spatiotemporal graph convolutional networks with frequency-based feature encoding to better capture heterogeneous station topologies and dynamic traffic patterns. Key contributions include: (i) constructing the largest publicly available multi-regional Indian railway dataset to date; and (ii) achieving state-of-the-art performance—reducing mean absolute error by 12.3%–18.7% over multiple SOTA baselines—demonstrating substantial improvements in both prediction accuracy and cross-station generalization capability.

Technology Category

Application Category

📝 Abstract
Accurate prediction of train delays is critical for efficient railway operations, enabling better scheduling and dispatching decisions. While earlier approaches have largely focused on forecasting the exact delays of individual trains, recent studies have begun exploring station-level delay prediction to support higher-level traffic management. In this paper, we propose the Railway-centric Spatio-Temporal Graph Convolutional Network (RSTGCN), designed to forecast average arrival delays of all the incoming trains at railway stations for a particular time period. Our approach incorporates several architectural innovations and novel feature integrations, including train frequency-aware spatial attention, which significantly enhances predictive performance. To support this effort, we curate and release a comprehensive dataset for the entire Indian Railway Network (IRN), spanning 4,735 stations across 17 zones - the largest and most diverse railway network studied to date. We conduct extensive experiments using multiple state-of-the-art baselines, demonstrating consistent improvements across standard metrics. Our work not only advances the modeling of average delay prediction in large-scale rail networks but also provides an open dataset to encourage further research in this critical domain.
Problem

Research questions and friction points this paper is trying to address.

Predicting average train arrival delays at stations
Modeling spatio-temporal dependencies in railway networks
Enhancing delay forecasting for large-scale rail operations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Railway-centric spatio-temporal graph convolutional network
Train frequency-aware spatial attention mechanism
Large-scale Indian Railway dataset integration
K
Koyena Chowdhury
Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur - 721302, India
Paramita Koley
Paramita Koley
Post Doc, Machine Intelligence Unit, ISI Kolkata
Machine LearningTemporal learning
Abhijnan Chakraborty
Abhijnan Chakraborty
Assistant Professor, Computer Science & Engg., IIT Kharagpur
Responsible AISocial ComputingInformation RetrievalAI for Social Good
S
Saptarshi Ghosh
Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur - 721302, India