LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving

📅 2025-07-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Urban autonomous driving faces core challenges in complex and edge-case scenarios, including insufficient semantic understanding, biased intent recognition, and decision-making logic diverging from human driving norms. To address these, we propose a dual-rate decision architecture integrating large language models (LLMs): a high-frequency end-to-end module ensures real-time control, while a low-frequency semantic module leverages LLMs for multimodal perception fusion and chain-of-thought reasoning—significantly enhancing situational awareness and robustness in unconventional scenarios. Crucially, we introduce LLMs into the planning closed loop, enabling human-aligned, robust decisions when conventional planners fail. Furthermore, we enhance end-to-end training via high-definition map integration and imitation learning. Evaluated on the CARLA Leaderboard V1 benchmark, our method achieves a score of 71 and a route completion rate of 93%, outperforming state-of-the-art approaches.

Technology Category

Application Category

📝 Abstract
A principal barrier to large-scale deployment of urban autonomous driving systems lies in the prevalence of complex scenarios and edge cases. Existing systems fail to effectively interpret semantic information within traffic contexts and discern intentions of other participants, consequently generating decisions misaligned with skilled drivers' reasoning patterns. We present LeAD, a dual-rate autonomous driving architecture integrating imitation learning-based end-to-end (E2E) frameworks with large language model (LLM) augmentation. The high-frequency E2E subsystem maintains real-time perception-planning-control cycles, while the low-frequency LLM module enhances scenario comprehension through multi-modal perception fusion with HD maps and derives optimal decisions via chain-of-thought (CoT) reasoning when baseline planners encounter capability limitations. Our experimental evaluation in the CARLA Simulator demonstrates LeAD's superior handling of unconventional scenarios, achieving 71 points on Leaderboard V1 benchmark, with a route completion of 93%.
Problem

Research questions and friction points this paper is trying to address.

Handling complex urban driving scenarios and edge cases
Interpreting semantic traffic information and participant intentions
Aligning decisions with skilled driver reasoning patterns
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-rate architecture with E2E and LLM
LLM enhances scenario comprehension via multi-modal fusion
Chain-of-thought reasoning optimizes decision-making
🔎 Similar Papers
No similar papers found.
Y
Yuhang Zhang
College of Transportation, Tongji University, Shanghai 201804, China
J
Jiaqi Liu
College of Transportation, Tongji University, Shanghai 201804, China
Chengkai Xu
Chengkai Xu
Tongji University
Autonomous DrivingReinforcement LearningMulti-modal Model
Peng Hang
Peng Hang
Tongji University
autonomous vehiclecooperative drivingdecision making and controlintelligent transportation
J
Jian Sun
College of Transportation, Tongji University, Shanghai 201804, China