๐ค AI Summary
To address the significant performance degradation of reinforcement learning (RL) policies when transferring from simulation to real-world autonomous vehicles, this paper proposes a GPU-accelerated parallel simulation training framework incorporating systematic domain randomization, specifically designed for drift control under extreme maneuvering conditions. Methodologically, it integrates deep RL with high-fidelity multibody dynamics modeling and cross-physical-parameter domain randomization to enhance policy robustness in simulation. The primary contributions are: (i) the first successful end-to-end sim-to-real transfer of stable drift control on a custom-built, open-source 1/10-scale RC platform featuring independent wheel drive; and (ii) experimental validation demonstrating precise tracking of fixed-radius and variable-curvature trajectories, seamless steering-mode switching, and stable regulation of sideslip angleโthereby substantially narrowing the sim-to-real performance gap.
๐ Abstract
Drifting, characterized by controlled vehicle motion at high sideslip angles, is crucial for safely handling emergency scenarios at the friction limits. While recent reinforcement learning approaches show promise for drifting control, they struggle with the significant simulation-to-reality gap, as policies that perform well in simulation often fail when transferred to physical systems. In this paper, we present a reinforcement learning framework with GPU-accelerated parallel simulation and systematic domain randomization that effectively bridges the gap. The proposed approach is validated on both simulation and a custom-designed and open-sourced 1/10 scale Individual Wheel Drive (IWD) RC car platform featuring independent wheel speed control. Experiments across various scenarios from steady-state circular drifting to direction transitions and variable-curvature path following demonstrate that our approach achieves precise trajectory tracking while maintaining controlled sideslip angles throughout complex maneuvers in both simulated and real-world environments.