Interact, Instruct to Improve: A LLM-Driven Parallel Actor-Reasoner Framework for Enhancing Autonomous Vehicle Interactions

๐Ÿ“… 2025-03-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Real-time, interpretable bidirectional interaction between autonomous vehicles (AVs) and human-driven vehicles (HVs) remains hindered by weak intent expressivity, high response latency, and poor generalization across diverse human drivers. To address these challenges, we propose the Parallel Actor-Reasoner (PAR) framework: a novel LLM-driven Reasoner module enables semantic-level intent inference, while a memory-augmented Actor module ensures low-latency eHMI generation. We further introduce memory partitioning and a two-tier retrieval mechanismโ€”first enabling generalized interaction modeling across heterogeneous HV drivers. Evaluated in multi-agent simulation and real-world vehicle testing, PAR significantly improves intersection throughput (+23.6%) and interaction safety (41.2% reduction in conflict rate). The implementation is publicly available.

Technology Category

Application Category

๐Ÿ“ Abstract
Autonomous Vehicles (AVs) have entered the commercialization stage, but their limited ability to interact and express intentions still poses challenges in interactions with Human-driven Vehicles (HVs). Recent advances in large language models (LLMs) enable bidirectional human-machine communication, but the conflict between slow inference speed and the need for real-time decision-making challenges practical deployment. To address these issues, this paper introduces a parallel Actor-Reasoner framework designed to enable explicit bidirectional AV-HV interactions across multiple scenarios. First, by facilitating interactions between the LLM-driven Reasoner and heterogeneous simulated HVs during training, an interaction memory database, referred to as the Actor, is established. Then, by introducing the memory partition module and the two-layer memory retrieval module, the Actor's ability to handle heterogeneous HVs is significantly enhanced. Ablation studies and comparisons with other decision-making methods demonstrate that the proposed Actor-Reasoner framework significantly improves safety and efficiency. Finally, with the combination of the external Human-Machine Interface (eHMI) information derived from Reasoner's reasoning and the feasible action solutions retrieved from the Actor, the effectiveness of the proposed Actor-Reasoner is confirmed in multi-scenario field interactions. Our code is available at https://github.com/FanGShiYuu/Actor-Reasoner.
Problem

Research questions and friction points this paper is trying to address.

Enhance AV-HV interaction using LLM-driven framework
Address slow inference speed for real-time decision-making
Improve safety and efficiency in multi-scenario interactions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Parallel Actor-Reasoner framework for AV-HV interactions
LLM-driven Reasoner with interaction memory database
Memory partition and two-layer retrieval modules
๐Ÿ”Ž Similar Papers
No similar papers found.
Shiyu Fang
Shiyu Fang
Tongji University
J
Jiaqi Liu
College of Transportation, Tongji University, Shanghai 201804, China, and the State Key Lab of Intelligent Transportation System, Beijing 100088, China
Chengkai Xu
Chengkai Xu
Tongji University
Autonomous DrivingReinforcement LearningMulti-modal Model
C
Chen Lv
Nanyang Technological University, 639798, Singapore
Peng Hang
Peng Hang
Tongji University
autonomous vehiclecooperative drivingdecision making and controlintelligent transportation
J
Jian Sun
College of Transportation, Tongji University, Shanghai 201804, China, and the State Key Lab of Intelligent Transportation System, Beijing 100088, China