SARAD: LLM-Based Safety-Aware Hybrid Reinforcement Learning with Collision Prediction for Autonomous Driving

📅 2026-05-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes SARAD, a novel framework that integrates large language models (LLMs) with deep reinforcement learning (DRL) to address the challenge of balancing safety and efficiency in autonomous driving decision-making. SARAD leverages retrieval-augmented generation (RAG) to produce guided actions from a dynamic expert knowledge base, replacing unsafe random exploration during training. An attention-based discriminator incorporates LLM-derived prior knowledge into policy optimization, while a dedicated collision prediction module—fine-tuned on historical collision data—further enhances safety. Evaluated in the Highway-Env simulation environment, SARAD demonstrates significantly improved driving performance by simultaneously ensuring safety and increasing decision-making efficiency, thereby validating its effectiveness.
📝 Abstract
Ensuring both safety and efficiency in decision-making for autonomous driving systems remains a fundamental challenge. Traditional Deep Reinforcement Learning (DRL) suffers from unsafe random exploration and slow convergence, while Large Language Models (LLMs) demonstrate inherent latency in real-time inference operations. To address these limitations, this paper proposes SARAD, a novel safety-aware hybrid framework that synergizes LLMs and DRL for autonomous driving. SARAD substitutes the random exploration of DRL with Retrieval-Augmented Generation (RAG)-enhanced, LLM-guided decisions sourced from a dynamic expert knowledge repository. An attention discriminator is proposed to integrate the prior knowledge of LLMs into DRL policy optimization. A collision predictor module, fine-tuned with historical collision data, is further designed to improve vehicle safety. Extensive experiments show that SARAD achieves significant performance improvements in the Highway-Env simulator, validating the effectiveness of the proposed model in autonomous driving.
Problem

Research questions and friction points this paper is trying to address.

autonomous driving
safety
reinforcement learning
Large Language Models
collision prediction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid Reinforcement Learning
Large Language Models
Retrieval-Augmented Generation
Collision Prediction
Safety-Aware Decision Making
🔎 Similar Papers
No similar papers found.
K
Kangyu Wu
School of Automation, Southeast University, Nanjing, China; Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Nanjing, China
P
Peng Cui
School of Automation, Southeast University, Nanjing, China; Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Nanjing, China
G
Guoxi Chen
School of Automation, Southeast University, Nanjing, China; Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Nanjing, China
Ya Zhang
Ya Zhang
Shanghai Jiao Tong University
Machine learningComputer visionMedical Imaging