Efficient and Generalized end-to-end Autonomous Driving System with Latent Deep Reinforcement Learning and Demonstrations

📅 2024-01-22

📈 Citations: 3

✨ Influential: 0

career value

236K/year

🤖 AI Summary

To address the high sample complexity, poor generalization, and insufficient safety guarantees of existing end-to-end autonomous driving approaches, this paper proposes EGADS—a latent-space deep reinforcement learning framework tailored for complex urban driving scenarios. Methodologically, EGADS integrates variational inference with normalizing flows to establish a prior-free RL paradigm; introduces a robust safety-constrained decision-making mechanism to ensure reliability; and constructs a professional driving demonstration dataset collected via a Logitech G29 steering wheel. Leveraging latent-space modeling, VAE-based representation learning, and joint imitation-reinforcement optimization, EGADS substantially reduces training sample requirements. Empirical evaluation demonstrates significant improvements in safety performance within urban environments and superior cross-scenario generalization compared to state-of-the-art end-to-end methods.

Technology Category

Application Category

📝 Abstract

An intelligent driving system should dynamically formulate appropriate driving strategies based on the current environment and vehicle status while ensuring system security and reliability. However, methods based on reinforcement learning and imitation learning often suffer from high sample complexity, poor generalization, and low safety. To address these challenges, this paper introduces an Efficient and Generalized end-to-end Autonomous Driving System (EGADS) for complex and varied scenarios. The RL agent in our EGADS combines variational inference with normalizing flows, which are independent of distribution assumptions. This combination allows the agent to capture historical information relevant to driving in latent space effectively, thereby significantly reducing sample complexity. Additionally, we enhance safety by formulating robust safety constraints and improve generalization and performance by integrating RL with expert demonstrations. Experimental results demonstrate that, compared to existing methods, EGADS significantly reduces sample complexity, greatly improves safety performance, and exhibits strong generalization capabilities in complex urban scenarios. Particularly, we contributed an expert dataset collected through human expert steering wheel control, specifically using the G29 steering wheel.

Problem

Research questions and friction points this paper is trying to address.

High sample complexity in reinforcement learning for autonomous driving

Poor generalization of driving systems in varied scenarios

Low safety performance in end-to-end autonomous driving

Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines variational inference with normalizing flows

Integrates RL with expert demonstrations

Formulates robust safety constraints

🔎 Similar Papers

Enhancing End-to-End Autonomous Driving with Latent World Model