Learning a thousand tasks in a day.

📅 2025-11-12

🏛️ Science Robotics

📈 Citations: 0

✨ Influential: 0

career value

199K/year

🤖 AI Summary

To address the challenges of high demonstration-sample requirements and poor cross-task generalization in robotic imitation learning, this paper proposes Multi-Task Trajectory Transfer (MT3). The method innovatively decomposes manipulation trajectories into two semantically distinct phases—“alignment” and “interaction”—and integrates retrieval-augmented learning to enable cross-task and cross-object knowledge transfer from minimal demonstrations (one per task). MT3 unifies trajectory decomposition, retrieval-augmented learning, contrastive behavior cloning, and modular policy modeling. Evaluated on a real-world robot platform, MT3 achieves 1,000 diverse daily tasks using less than 24 hours of human demonstration data. Its data efficiency improves by an order of magnitude over conventional approaches, significantly advancing few-shot, multi-task imitation learning.

Technology Category

Application Category

📝 Abstract

Humans are remarkably efficient at learning tasks from demonstrations, but today's imitation learning methods for robot manipulation often require hundreds or thousands of demonstrations per task. We investigated two fundamental priors for improving learning efficiency: decomposing manipulation trajectories into sequential alignment and interaction phases and retrieval-based generalization. Through 3450 real-world rollouts, we systematically studied this decomposition. We compared different design choices for the alignment and interaction phases and examined generalization and scaling trends relative to today's dominant paradigm of behavioral cloning with a single-phase monolithic policy. In the few-demonstrations-per-task regime (<10 demonstrations), decomposition achieved an order of magnitude of improvement in data efficiency over single-phase learning, with retrieval consistently outperforming behavioral cloning for both alignment and interaction. Building on these insights, we developed Multi-Task Trajectory Transfer (MT3), an imitation learning method based on decomposition and retrieval. MT3 learns everyday manipulation tasks from as little as a single demonstration each while also generalizing to previously unseen object instances. This efficiency enabled us to teach a robot 1000 distinct everyday tasks in under 24 hours of human demonstrator time. Through 2200 additional real-world rollouts, we reveal MT3's capabilities and limitations across different task families.

Problem

Research questions and friction points this paper is trying to address.

Improving robot imitation learning efficiency from few demonstrations

Developing decomposition methods for manipulation trajectory phases

Enabling generalization to novel objects with single demonstrations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Decomposes trajectories into alignment and interaction phases

Uses retrieval-based generalization for improved data efficiency

Learns tasks from single demonstrations with novel object generalization

🔎 Similar Papers

No similar papers found.

Toyota Research Institute

Los Altos, CA / Cambridge, MA

Senior Robotics Engineer- Spot Manipulation

Boston Dynamics

The base pay range for this position is between $155,000 to $220,000 annually. Base pay will depend on multiple individualized factors including, but not limited to internal equity, job related knowledge, skills and experience. This range represents a good faith estimate of compensation at the time of posting. Boston Dynamics offers a generous Benefits package including medical, dental vision, 401(k), paid time off and a annual bonus structure. Additional details regarding these benefit plans will be provided if an employee receives an offer for employment.

Waltham, MA

Research Scientist Intern, Robotic Control Policy (PhD)