Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain

📅 2024-09-26

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

193K/year

🤖 AI Summary

To address low sample efficiency, poor generalization, and challenging sim-to-real transfer for wheeled robots navigating vertically challenging terrains, this paper proposes Verti-Selector, an automated curriculum learning framework. The method integrates reinforcement learning with the Chrono multi-physics engine (VW-Chrono) for high-fidelity simulation and is validated on the real-world Verti-4-Wheeler platform. Its core contribution is a novel TD-error-driven terrain-adaptive sampling mechanism that dynamically prioritizes regions with high temporal-difference error, enabling capability-boundary-guided, progressive curriculum generation without manual environment sequencing. Experimental results demonstrate a 23.08% improvement in task success rate in both simulation and physical deployment, alongside significantly enhanced cross-domain transfer robustness and real-world deployability.

Technology Category

Application Category

📝 Abstract

Reinforcement Learning (RL) has the potential to enable extreme off-road mobility by circumventing complex kinodynamic modeling, planning, and control by simulated end-to-end trial-and-error learning experiences. However, most RL methods are sample-inefficient when training in a large amount of manually designed simulation environments and struggle at generalizing to the real world. To address these issues, we introduce VertiSelector (VS), an automatic curriculum learning framework designed to enhance learning efficiency and generalization by selectively sampling training terrain. VS prioritizes vertically challenging terrain with higher Temporal Difference (TD) errors when revisited, thereby allowing robots to learn at the edge of their evolving capabilities. By dynamically adjusting the sampling focus, VS significantly boosts sample efficiency and generalization within the VW-Chrono simulator built on the Chrono multi-physics engine. Furthermore, we provide simulation and physical results using VS on a Verti-4-Wheeler platform. These results demonstrate that VS can achieve 23.08% improvement in terms of success rate by efficiently sampling during training and robustly generalizing to the real world.

Problem

Research questions and friction points this paper is trying to address.

Enhance learning efficiency in RL

Improve generalization to real-world terrain

Automatically sample challenging training terrain

Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatic curriculum learning framework

Selective sampling of terrain

Dynamic adjustment of sampling focus

🔎 Similar Papers

VertiCoder: Self-Supervised Kinodynamic Representation Learning on Vertically Challenging Terrain