SanDRA: Safe Large-Language-Model-Based Decision Making for Automated Vehicles Using Reachability Analysis

📅 2025-10-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) exhibit hallucination risks and lack explicit vehicle dynamics modeling, undermining safety and regulatory compliance in knowledge-driven autonomous driving decision-making. Method: This paper proposes the first decision-making framework integrating LLMs with formal reachability analysis. It encodes LLM-generated and ranked driving maneuvers as temporal logic formulas, then jointly verifies them against traffic regulations and a precise vehicle dynamics model via reachability analysis—enabling provably safe and compliant decisions. Contribution/Results: The method achieves tight coupling between symbolic knowledge reasoning and rigorous mathematical verification for the first time. It guarantees both safety and legal compliance in high-density traffic scenarios, supporting both open-loop and closed-loop verification. To foster reproducibility and further research, the implementation code and experimental configurations are publicly released.

Technology Category

Application Category

📝 Abstract
Large language models have been widely applied to knowledge-driven decision-making for automated vehicles due to their strong generalization and reasoning capabilities. However, the safety of the resulting decisions cannot be ensured due to possible hallucinations and the lack of integrated vehicle dynamics. To address this issue, we propose SanDRA, the first safe large-language-model-based decision making framework for automated vehicles using reachability analysis. Our approach starts with a comprehensive description of the driving scenario to prompt large language models to generate and rank feasible driving actions. These actions are translated into temporal logic formulas that incorporate formalized traffic rules, and are subsequently integrated into reachability analysis to eliminate unsafe actions. We validate our approach in both open-loop and closed-loop driving environments using off-the-shelf and finetuned large language models, showing that it can provide provably safe and, where possible, legally compliant driving actions, even under high-density traffic conditions. To ensure transparency and facilitate future research, all code and experimental setups are publicly available at github.com/CommonRoad/SanDRA.
Problem

Research questions and friction points this paper is trying to address.

Ensuring safety in LLM-based decision making for automated vehicles
Addressing hallucinations and lack of vehicle dynamics integration
Providing provably safe and legally compliant driving actions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses reachability analysis to eliminate unsafe actions
Translates LLM decisions into temporal logic formulas
Ensures provably safe driving under dense traffic
🔎 Similar Papers
No similar papers found.
Y
Yuanfei Lin
Department of Computer Engineering, Technical University of Munich, 85748 Garching, Germany
S
Sebastian Illing
Department of Computer Engineering, Technical University of Munich, 85748 Garching, Germany
Matthias Althoff
Matthias Althoff
Associate Professor in Computer Science, Technische Universität München
Cyber-Physical SystemsFormal VerificationReachability AnalysisRobotics and Automated Driving