COIN: Collaborative Interaction-Aware Multi-Agent Reinforcement Learning for Self-Driving Systems

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of achieving efficient and safe collaboration in dense, dynamic multi-agent autonomous driving scenarios. The authors propose COIN, a collaborative interaction-aware framework operating under the centralized training with decentralized execution (CTDE) paradigm, which jointly optimizes individual navigation objectives and global coordination goals. Key innovations include a two-level interaction-aware centralized critic architecture that integrates local pairwise interactions with global system dependencies, and a counterfactual individual-global twin-delayed deep deterministic policy gradient algorithm (CIG-TD3) to enhance credit assignment and policy coordination. Experimental results demonstrate that COIN significantly outperforms existing methods in both high-density urban traffic simulations and real-world robotic platforms, achieving state-of-the-art performance in safety and traffic efficiency.

Technology Category

Application Category

📝 Abstract
Multi-Agent Self-Driving (MASD) systems provide an effective solution for coordinating autonomous vehicles to reduce congestion and enhance both safety and operational efficiency in future intelligent transportation systems. Multi-Agent Reinforcement Learning (MARL) has emerged as a promising approach for developing advanced end-to-end MASD systems. However, achieving efficient and safe collaboration in dynamic MASD systems remains a significant challenge in dense scenarios with complex agent interactions. To address this challenge, we propose a novel collaborative(CO-) interaction-aware(-IN) MARL framework, named COIN. Specifically, we develop a new counterfactual individual-global twin delayed deep deterministic policy gradient (CIG-TD3) algorithm, crafted in a "centralized training, decentralized execution" (CTDE) manner, which aims to jointly optimize the individual objectives (navigation) and the global objectives (collaboration) of agents. We further introduce a dual-level interaction-aware centralized critic architecture that captures both local pairwise interactions and global system-level dependencies, enabling more accurate global value estimation and improved credit assignment for collaborative policy learning. We conduct extensive simulation experiments in dense urban traffic environments, which demonstrate that COIN consistently outperforms other advanced baseline methods in both safety and efficiency across various system sizes. These results highlight its superiority in complex and dynamic MASD scenarios, as further validated through real-world robot demonstrations. Supplementary videos are available at https://marmotlab.github.io/COIN/
Problem

Research questions and friction points this paper is trying to address.

Multi-Agent Self-Driving
Collaboration
Complex Agent Interactions
Safety and Efficiency
Dense Traffic Scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-agent reinforcement learning
interaction-aware
centralized critic
credit assignment
autonomous driving
🔎 Similar Papers
No similar papers found.
Y
Yifeng Zhang
Department of Mechanical Engineering, College of Design and Engineering, National University of Singapore, 21 Lower Kent Ridge Rd, 119077, Singapore
J
Jieming Chen
Department of Electrical and Electronic Engineering, The Hong Kong Polytechnic University, Hong Kong, 100872, China
T
Tingguang Zhou
Department of Mechanical Engineering, College of Design and Engineering, National University of Singapore, 21 Lower Kent Ridge Rd, 119077, Singapore
Tanishq Duhan
Tanishq Duhan
Student, BITS Pilani
RoboticsMulti Agent Systems
J
Jianghong Dong
School of Vehicle and Mobility, Tsinghua University, Beijing, 100084, China
Yuhong Cao
Yuhong Cao
National University of Singapore
Robot learningPath Planing
Guillaume Sartoretti
Guillaume Sartoretti
Assistant Professor, National University of Singapore (NUS), Mechanical Engineering Dpt
Multi-Agent SystemsRoboticsSwarm IntelligenceDistributed ControlDistributed Learning