LodeStar: Long-horizon Dexterity via Synthetic Data Augmentation from Human Demonstrations

📅 2025-08-24

📈 Citations: 0

✨ Influential: 0

career value

204K/year

🤖 AI Summary

This paper addresses three key challenges in long-horizon, dexterous robotic manipulation: poor physical robustness, difficulty in skill composition and sequencing, and high cost of real-world data acquisition. To this end, we propose a multi-skill synthesis and orchestration framework grounded in few-shot human demonstrations. Our core contributions are: (1) automatic decomposition of demonstrations into semantically meaningful skill units using foundation models; (2) a skill-routing Transformer architecture that enables dynamic skill selection and end-to-end long-horizon policy generation; and (3) a unified learning pipeline integrating imitation learning, reinforcement learning, synthetic data augmentation, and sim-to-real transfer—requiring only minimal real-world demonstrations to generate diverse, high-quality training data. Evaluated on three real-world long-horizon manipulation tasks, our method achieves substantial improvements in task success rate (+28.6%) and robustness under environmental disturbances, consistently outperforming state-of-the-art baselines.

Technology Category

Application Category

📝 Abstract

Developing robotic systems capable of robustly executing long-horizon manipulation tasks with human-level dexterity is challenging, as such tasks require both physical dexterity and seamless sequencing of manipulation skills while robustly handling environment variations. While imitation learning offers a promising approach, acquiring comprehensive datasets is resource-intensive. In this work, we propose a learning framework and system LodeStar that automatically decomposes task demonstrations into semantically meaningful skills using off-the-shelf foundation models, and generates diverse synthetic demonstration datasets from a few human demos through reinforcement learning. These sim-augmented datasets enable robust skill training, with a Skill Routing Transformer (SRT) policy effectively chaining the learned skills together to execute complex long-horizon manipulation tasks. Experimental evaluations on three challenging real-world long-horizon dexterous manipulation tasks demonstrate that our approach significantly improves task performance and robustness compared to previous baselines. Videos are available at lodestar-robot.github.io.

Problem

Research questions and friction points this paper is trying to address.

Robust execution of long-horizon dexterous manipulation tasks

Resource-intensive data acquisition for imitation learning

Handling environmental variations in complex robotic manipulation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Decomposes demonstrations into skills

Generates synthetic datasets via reinforcement learning

Uses Skill Routing Transformer for chaining

🔎 Similar Papers

No similar papers found.

Boston Dynamics

The base pay range for this position is between $155,000 to $220,000 annually. Base pay will depend on multiple individualized factors including, but not limited to internal equity, job related knowledge, skills and experience. This range represents a good faith estimate of compensation at the time of posting. Boston Dynamics offers a generous Benefits package including medical, dental vision, 401(k), paid time off and a annual bonus structure. Additional details regarding these benefit plans will be provided if an employee receives an offer for employment.

Waltham, MA

Research Scientist Intern, Robotic Control Policy (PhD)