Navigating the Clutter: Waypoint-Based Bi-Level Planning for Multi-Robot Systems

๐Ÿ“… 2026-04-22
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

216K/year
๐Ÿค– AI Summary
This work addresses the challenge of multi-robot coordination in cluttered environments, where physical constraints such as collisions and kinematic infeasibility hinder effective collaboration. To tackle this, the authors propose a hybrid two-layer planning framework that jointly optimizes high-level task allocation and low-level motion planning. The key innovation lies in a concise waypoint-based trajectory parameterization and a curriculum learningโ€“inspired credit assignment mechanism that efficiently propagates motion feasibility feedback from the planning layer to the task layer. This integration is further enhanced by an improved RLVR reinforcement learning algorithm, enabling end-to-end joint optimization. Experiments on the BoxNet3D-OBS benchmark demonstrate that the proposed method significantly outperforms motion-agnostic and VLA baselines, consistently achieving higher task success rates in complex scenarios with dense obstacles and up to nine robots.

Technology Category

Application Category

๐Ÿ“ Abstract
Multi-robot control in cluttered environments is a challenging problem that involves complex physical constraints, including robot-robot collisions, robot-obstacle collisions, and unreachable motions. Successful planning in such settings requires joint optimization over high-level task planning and low-level motion planning, as violations of physical constraints may arise from failures at either level. However, jointly optimizing task and motion planning is difficult due to the complex parameterization of low-level motion trajectories and the ambiguity of credit assignment across the two planning levels. In this paper, we propose a hybrid multi-robot control framework that jointly optimizes task and motion planning. To enable effective parameterization of low-level planning, we introduce waypoints, a simple yet expressive representation for motion trajectories. To address the credit assignment challenge, we adopt a curriculum-based training strategy with a modified RLVR algorithm that propagates motion feasibility feedback from the motion planner to the task planner. Experiments on BoxNet3D-OBS, a challenging multi-robot benchmark with dense obstacles and up to nine robots, show that our approach consistently improves task success over motion-agnostic and VLA-based baselines. Our code is available at https://github.com/UCSB-NLP-Chang/navigate-cluster
Problem

Research questions and friction points this paper is trying to address.

multi-robot systems
cluttered environments
task and motion planning
physical constraints
collision avoidance
Innovation

Methods, ideas, or system contributions that make the work stand out.

waypoint-based planning
bi-level planning
multi-robot systems
task and motion planning
curriculum-based reinforcement learning