🤖 AI Summary
This study investigates the heterogeneity in travel behavior between high-frequency (HF, top 25%) and low-frequency (LF, bottom 75%) bus passengers in urban transit systems, and its implications for network structure, robustness, and efficiency. Leveraging 20 million Beijing bus smart-card transactions, we construct and comparatively analyze two distinct passenger-based complex networks for the first time. Methodologically, we integrate large-scale data cleaning, K-means clustering, multidimensional network metrics modeling (e.g., clustering coefficient, degree centrality, average path length), spatiotemporal pattern mining, and cascading failure simulations. Results reveal that the HF network exhibits high clustering (C = 0.72) but low robustness—efficiency drops by 35% under targeted attacks—whereas the LF network is more dispersed yet resilient, with only a 10% efficiency decline. Moreover, 57.4% of HF trips concentrate in the morning peak. Based on these findings, we propose group-specific strategies to enhance system resilience and equitable accessibility, offering quantitative foundations for precision-oriented urban transport governance.
📝 Abstract
This study investigates the network characteristics of high-frequency (HF) and low-frequency (LF) travelers in urban public transport systems by analyzing 20 million smart card records from Beijing's transit network. A novel methodology integrates advanced data preprocessing, clustering techniques, and complex network analysis to differentiate HF and LF passenger behaviors and their impacts on network structure, robustness, and efficiency. The primary challenge is accurately segmenting and modeling the behaviors of diverse passenger groups within a large-scale, noisy dataset while maintaining computational efficiency and scalability. HF networks, representing the top 25% of travelers by usage frequency, exhibit high connectivity with an average clustering coefficient of 0.72 and greater node degree centrality. However, they have lower robustness, with efficiency declining by 35% under targeted disruptions and longer average path lengths of 6.2 during peak hours. In contrast, LF networks, which include 75% of travelers, are more dispersed yet resilient, with efficiency declining by only 10% under similar disruptions and stronger intracommunity connectivity. Temporal analysis reveals that HF passengers significantly contribute to peak-hour congestion, with 57.4% of HF trips occurring between 6:00 and 10:00 AM, while LF passengers show a broader temporal distribution, helping to mitigate congestion hotspots. Understanding these travel patterns is crucial for optimizing public transit systems. The findings suggest targeted strategies such as enhancing robustness in HF networks by diversifying key routes and improving accessibility in LF-dominated areas. This research provides a scalable framework for analyzing smart card data and offers actionable insights for optimizing transit networks, improving congestion management, and advancing sustainable urban mobility planning.