A Champion-Level Vision-Based Reinforcement Learning Agent for Competitive Racing in Gran Turismo 7

📅 2025-04-12
🏛️ IEEE Robotics and Automation Letters
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of achieving high-performance autonomous racing using only onboard camera and sensor data—without external positioning systems—under realistic conditions. Method: We propose the first end-to-end vision-driven reinforcement learning racing agent, trained in the high-fidelity Gran Turismo 7 simulator. Our approach features an asymmetric Actor-Critic architecture: the Actor operates solely on local sensory inputs, while the Critic leverages global information during training to improve policy quality. The agent integrates an RNN-based policy network, a lightweight visual encoder, a self-supervised trajectory prediction module, and an asynchronous distributed PPO framework. Results: The agent consistently outperforms all built-in GT7 AI opponents and achieves performance comparable to professional human racers. Its core contribution is the empirical validation of high-fidelity autonomous racing feasibility using purely onboard perception, establishing a transferable technical paradigm for real-world track-based autonomous driving.

Technology Category

Application Category

📝 Abstract
Deep reinforcement learning has achieved superhuman racing performance in high-fidelity simulators like Gran Turismo 7 (GT7). It typically utilizes global features that require instrumentation external to a car, such as precise localization of agents and opponents, limiting real-world applicability. To address this limitation, we introduce a vision-based autonomous racing agent that relies solely on ego-centric camera views and onboard sensor data, eliminating the need for precise localization during inference. This agent employs an asymmetric actor-critic framework: the actor uses a recurrent neural network with the sensor data local to the car to retain track layouts and opponent positions, while the critic accesses the global features during training. Evaluated in GT7, our agent consistently outperforms GT7's built-drivers. To our knowledge, this work presents the first vision-based autonomous racing agent to demonstrate champion-level performance in competitive racing scenarios.
Problem

Research questions and friction points this paper is trying to address.

Develops vision-based agent for autonomous racing without external instrumentation
Uses ego-centric camera views and onboard sensors for real-world applicability
Achieves champion-level performance in competitive racing scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Vision-based agent using ego-centric camera views
Asymmetric actor-critic framework with recurrent network
Champion-level performance without precise localization
🔎 Similar Papers
No similar papers found.
H
Hojoon Lee
KAIST, Daejeon, South Korea
Takuma Seno
Takuma Seno
Turing Inc.
Deep reinforcement learningDeep learning
J
Jun Jet Tai
Coventry University, Coventry, UK
K
Kaushik Subramanian
Sony AI, Zürich, Switzerland
Kenta Kawamoto
Kenta Kawamoto
Sony AI Inc.
Machine LearningRoboticsDevelopmental Intelligence
P
Peter Stone
Sony AI, New York, USA; University of Texas at Austin, USA
P
Peter R. Wurman
Sony AI, New York, USA