🤖 AI Summary
To address the low safety and inefficiency of manual operation of micro-mobility devices (e.g., delivery robots, e-scooters) in complex urban environments, this paper introduces URBAN-SIM—a high-fidelity, scalable, physics-engine-driven urban simulation platform enabling large-scale embodied reinforcement learning for multi-modal micro-robots (e.g., wheeled, legged). Methodologically, it innovatively integrates hierarchical city generation, interactive dynamics modeling, and asynchronous scenario sampling. It also proposes URBAN-BENCH—the first systematic, three-dimensional autonomy benchmark encompassing traversal, navigation, and crossing—enabling fair cross-platform evaluation across robot morphologies. Extensive experiments across diverse urban terrains evaluate eight tasks, revealing performance boundaries and adaptive behavioral patterns of distinct robot configurations under dynamic human-robot coexistence. Results demonstrate URBAN-SIM’s capacity to support robust, generalizable policy learning and provide empirical insights into morphology-dependent autonomy limitations in realistic urban settings.
📝 Abstract
Micromobility, which utilizes lightweight mobile machines moving in urban public spaces, such as delivery robots and mobility scooters, emerges as a promising alternative to vehicular mobility. Current micromobility depends mostly on human manual operation (in-person or remote control), which raises safety and efficiency concerns when navigating busy urban environments full of unpredictable obstacles and pedestrians. Assisting humans with AI agents in maneuvering micromobility devices presents a viable solution for enhancing safety and efficiency. In this work, we present a scalable urban simulation solution to advance autonomous micromobility. First, we build URBAN-SIM - a high-performance robot learning platform for large-scale training of embodied agents in interactive urban scenes. URBAN-SIM contains three critical modules: Hierarchical Urban Generation pipeline, Interactive Dynamics Generation strategy, and Asynchronous Scene Sampling scheme, to improve the diversity, realism, and efficiency of robot learning in simulation. Then, we propose URBAN-BENCH - a suite of essential tasks and benchmarks to gauge various capabilities of the AI agents in achieving autonomous micromobility. URBAN-BENCH includes eight tasks based on three core skills of the agents: Urban Locomotion, Urban Navigation, and Urban Traverse. We evaluate four robots with heterogeneous embodiments, such as the wheeled and legged robots, across these tasks. Experiments on diverse terrains and urban structures reveal each robot's strengths and limitations.