π€ AI Summary
This work addresses the challenge of enabling robots to generalize vision-driven manipulation skills to unseen objects in open-world environments from a single human RGB-D demonstration video. To this end, we propose ORIONβa novel one-shot imitation learning framework grounded in an open-world object graph, requiring no predefined object categories or environmental priors. Its core components include object-centric modeling, RGB-D video parsing, manipulation graph extraction, conditional policy learning, and multimodal representation alignment. ORION achieves strong generalization across varying backgrounds, viewpoints, scene layouts, and previously unseen object instances, enabling robust operation planning and policy transfer. Experiments demonstrate that ORION significantly outperforms existing baselines on both short- and long-horizon tasks. It supports real-world deployment using consumer-grade devices (e.g., iPad) and successfully transfers policies to diverse physical environments, accomplishing zero-shot manipulation of novel objects.
π Abstract
We present an object-centric approach to empower robots to learn vision-based manipulation skills from human videos. We investigate the problem of imitating robot manipulation from a single human video in the open-world setting, where a robot must learn to manipulate novel objects from one video demonstration. We introduce ORION, an algorithm that tackles the problem by extracting an object-centric manipulation plan from a single RGB-D video and deriving a policy that conditions on the extracted plan. Our method enables the robot to learn from videos captured by daily mobile devices such as an iPad and generalize the policies to deployment environments with varying visual backgrounds, camera angles, spatial layouts, and novel object instances. We systematically evaluate our method on both short-horizon and long-horizon tasks, demonstrating the efficacy of ORION in learning from a single human video in the open world. Videos can be found in the project website https://ut-austin-rpl.github.io/ORION-release.