Abstracting Robot Manipulation Skills via Mixture-of-Experts Diffusion Policies

๐Ÿ“… 2026-01-29
๐Ÿ“ˆ Citations: 1
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the scalability limitations of existing diffusion-based policies in multitask robotic manipulation, which suffer from large model sizes and high demonstration data costs. To overcome these challenges, the authors propose Skill Mixture-of-Experts Policy (SMP), a novel framework that learns a compact set of orthogonal skill bases and dynamically activates only a few task-relevant experts at each action step, enabling efficient inference and strong generalization. SMP introduces a variational objectiveโ€“based mixture-of-experts diffusion architecture, augmented with sticky routing and adaptive expert activation mechanisms to significantly reduce computational overhead while enhancing cross-task transferability. Experiments demonstrate that SMP consistently outperforms large diffusion baselines on both simulated and real-world dual-arm platforms, achieving higher task success rates with substantially lower inference costs.

Technology Category

Application Category

๐Ÿ“ Abstract
Diffusion-based policies have recently shown strong results in robot manipulation, but their extension to multi-task scenarios is hindered by the high cost of scaling model size and demonstrations. We introduce Skill Mixture-of-Experts Policy (SMP), a diffusion-based mixture-of-experts policy that learns a compact orthogonal skill basis and uses sticky routing to compose actions from a small, task-relevant subset of experts at each step. A variational training objective supports this design, and adaptive expert activation at inference yields fast sampling without oversized backbones. We validate SMP in simulation and on a real dual-arm platform with multi-task learning and transfer learning tasks, where SMP achieves higher success rates and markedly lower inference cost than large diffusion baselines. These results indicate a practical path toward scalable, transferable multi-task manipulation: learn reusable skills once, activate only what is needed, and adapt quickly when tasks change.
Problem

Research questions and friction points this paper is trying to address.

robot manipulation
multi-task learning
diffusion policies
scalability
transfer learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mixture-of-Experts
Diffusion Policy
Skill Abstraction
Multi-task Robot Manipulation
Efficient Inference
๐Ÿ”Ž Similar Papers
No similar papers found.