🤖 AI Summary
Traffic safety faces a fundamental data paradox: high-consequence crashes (e.g., fatalities and severe injuries) are sparse, underreported, and difficult to model, leading to poor generalizability of conventional crash-frequency models and surrogate safety metrics—and consequent failure in long-tail risk prediction. To address this, we propose “counterfactual safety learning,” a novel paradigm that leverages *unrealized high-risk near-misses* as primary supervisory signals. We introduce a macroscopic statistical prior-guided microscopic scenario generation mechanism, integrating multi-agent driving behavior modeling, causal inference, and a generative scenario engine within a digital twin platform to augment risk-relevant signals. The resulting framework significantly improves long-tail crash prediction accuracy and enables pre-deployment stress testing for vehicle–infrastructure–control co-design. This advances traffic safety from reactive root-cause analysis toward proactive risk prevention.
📝 Abstract
Traffic safety science has long been hindered by a fundamental data paradox: the crashes we most wish to prevent are precisely those events we rarely observe. Existing crash-frequency models and surrogate safety metrics rely heavily on sparse, noisy, and under-reported records, while even sophisticated, high-fidelity simulations undersample the long-tailed situations that trigger catastrophic outcomes such as fatalities. We argue that the path to achieving Vision Zero, i.e., the complete elimination of traffic fatalities and severe injuries, requires a paradigm shift from traditional crash-only learning to a new form of counterfactual safety learning: reasoning not only about what happened, but also about the vast set of plausible yet perilous scenarios that could have happened under slightly different circumstances. To operationalize this shift, our proposed agenda bridges macro to micro. Guided by crash-rate priors, generative scene engines, diverse driver models, and causal learning, near-miss events are synthesized and explained. A crash-focused digital twin testbed links micro scenes to macro patterns, while a multi-objective validator ensures that simulations maintain statistical realism. This pipeline transforms sparse crash data into rich signals for crash prediction, enabling the stress-testing of vehicles, roads, and policies before deployment. By learning from crashes that almost happened, we can shift traffic safety from reactive forensics to proactive prevention, advancing Vision Zero.