Learning Multi-Stage Pick-and-Place with a Legged Mobile Manipulator

📅 2025-09-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of deploying quadrupedal mobile manipulators to perform multi-stage pick-and-place tasks (search → approach → grasp → transport → place) in partially observable, long-horizon, cross-indoor-outdoor real-world environments. We propose an end-to-end vision–motor policy framework trained entirely in simulation. Our method integrates hierarchical task decomposition, domain-randomized reinforcement learning, and self-supervised sim-to-real transfer. Key contributions include: (i) the first emergence of robust behaviors—such as regrasping and task chaining—in quadrupedal mobile manipulation; (ii) strong generalization to complex, unseen environments; and (iii) zero-shot deployment without real-world fine-tuning. Experiments demonstrate ≈80% task success rate in real-world settings. Ablation studies confirm the efficacy of each technical component.

Technology Category

Application Category

📝 Abstract
Quadruped-based mobile manipulation presents significant challenges in robotics due to the diversity of required skills, the extended task horizon, and partial observability. After presenting a multi-stage pick-and-place task as a succinct yet sufficiently rich setup that captures key desiderata for quadruped-based mobile manipulation, we propose an approach that can train a visuo-motor policy entirely in simulation, and achieve nearly 80% success in the real world. The policy efficiently performs search, approach, grasp, transport, and drop into actions, with emerged behaviors such as re-grasping and task chaining. We conduct an extensive set of real-world experiments with ablation studies highlighting key techniques for efficient training and effective sim-to-real transfer. Additional experiments demonstrate deployment across a variety of indoor and outdoor environments. Demo videos and additional resources are available on the project page: https://horizonrobotics.github.io/gail/SLIM.
Problem

Research questions and friction points this paper is trying to address.

Learning multi-stage pick-and-place with legged mobile manipulator
Addressing partial observability and extended task horizons
Achieving effective sim-to-real transfer for visuo-motor policies
Innovation

Methods, ideas, or system contributions that make the work stand out.

Simulation-trained visuo-motor policy
Multi-stage pick-and-place behavior
Effective sim-to-real transfer technique
🔎 Similar Papers
No similar papers found.