CFLight: Enhancing Safety with Traffic Signal Control through Counterfactual Learning

πŸ“… 2025-12-10
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the prevalent efficiency-over-safety bias and limited interpretability of reinforcement learning (RL) methods in traffic signal control, this paper proposes a causal RL framework integrated with counterfactual reasoning. Methodologically, we (1) construct a counterfactual module grounded in a structural causal model (SCM) to quantify the potential impact of signal actions on crash risk; (2) design an interpretable β€œX-module” embedded within the policy network to explicitly encode safety constraints; and (3) introduce a near-zero-collision control paradigm. Evaluated on SUMO simulations and real-world traffic data, our approach reduces collision rates by over 35% compared to state-of-the-art RL and safe RL baselines, while simultaneously improving traffic throughput. The code and datasets are publicly released, and the framework demonstrates strong cross-scenario generalizability.

Technology Category

Application Category

πŸ“ Abstract
Traffic accidents result in millions of injuries and fatalities globally, with a significant number occurring at intersections each year. Traffic Signal Control (TSC) is an effective strategy for enhancing safety at these urban junctures. Despite the growing popularity of Reinforcement Learning (RL) methods in optimizing TSC, these methods often prioritize driving efficiency over safety, thus failing to address the critical balance between these two aspects. Additionally, these methods usually need more interpretability. CounterFactual (CF) learning is a promising approach for various causal analysis fields. In this study, we introduce a novel framework to improve RL for safety aspects in TSC. This framework introduces a novel method based on CF learning to address the question: ``What if, when an unsafe event occurs, we backtrack to perform alternative actions, and will this unsafe event still occur in the subsequent period?'' To answer this question, we propose a new structure causal model to predict the result after executing different actions, and we propose a new CF module that integrates with additional ``X'' modules to promote safe RL practices. Our new algorithm, CFLight, which is derived from this framework, effectively tackles challenging safety events and significantly improves safety at intersections through a near-zero collision control strategy. Through extensive numerical experiments on both real-world and synthetic datasets, we demonstrate that CFLight reduces collisions and improves overall traffic performance compared to conventional RL methods and the recent safe RL model. Moreover, our method represents a generalized and safe framework for RL methods, opening possibilities for applications in other domains. The data and code are available in the github https://github.com/MJLee00/CFLight-Enhancing-Safety-with-Traffic-Signal-Control-through-Counterfactual-Learning.
Problem

Research questions and friction points this paper is trying to address.

Enhances traffic signal control safety using counterfactual learning
Addresses RL methods prioritizing efficiency over intersection safety
Reduces collisions with near-zero accident control strategy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses counterfactual learning to enhance safety in traffic signal control
Integrates causal models with reinforcement learning for safer intersections
Implements near-zero collision strategy through backtracking alternative actions
πŸ”Ž Similar Papers
No similar papers found.
M
Mingyuan Li
Lanzhou University, Beijing University of Posts and Telecommunications, Beijing, China
Chunyu Liu
Chunyu Liu
Beijing University of Posts and Telecommunications, Beijing, China
Zhuojun Li
Zhuojun Li
Tsinghua University
Human Computer Interaction
X
Xiao Liu
Beijing University of Posts and Telecommunications, Beijing, China
Guangsheng Yu
Guangsheng Yu
University of Technology Sydney
Security and Privacy of Machine LearningFederated LearningDistributed LearningWeb3Blockchain
Bo Du
Bo Du
Department of Management, Griffith Business School
Sustainable TransportTravel BehaviourUrban Data AnalyticsLogistics and Supply Chain
J
Jun Shen
University of Wollongong, Wollongong, Australia
Q
Qiang Wu
Lanzhou University, Lanzhou, China