RoboRouter: Training-Free Policy Routing for Robotic Manipulation

📅 2026-03-09

📈 Citations: 0

✨ Influential: 0

career value

197K/year

🤖 AI Summary

While existing robotic manipulation policies demonstrate effectiveness on specific tasks, their generalization capabilities remain limited. To address this challenge, this work proposes the first training-free policy routing framework that dynamically selects the most suitable policy from a heterogeneous policy pool by leveraging semantic task representations and retrieving relevant historical tasks, thereby enabling zero-shot, lightweight policy integration. The approach incorporates a structured feedback mechanism that effectively fuses experiences from multiple policy sources. Evaluated in both simulation and real-world environments, the method achieves average success rate improvements exceeding 3% and 13%, respectively, significantly enhancing system generalization while maintaining computational efficiency and fast execution.

Technology Category

Application Category

📝 Abstract

Research on robotic manipulation has developed a diverse set of policy paradigms, including vision-language-action (VLA) models, vision-action (VA) policies, and code-based compositional approaches. Concrete policies typically attain high success rates on specific task distributions but lim-ited generalization beyond it. Rather than proposing an other monolithic policy, we propose to leverage the complementary strengths of existing approaches through intelligent policy routing. We introduce RoboRouter, a training-free framework that maintains a pool of heterogeneous policies and learns to select the best-performing policy for each task through accumulated execution experience. Given a new task, RoboRouter constructs a semantic task representation, retrieves historical records of similar tasks, predicts the optimal policy choice without requiring trial-and-error, and incorporates structured feedback to refine subsequent routing decisions. Integrating a new policy into the system requires only lightweight evaluation and incurs no training overhead. Across simulation benchmark and real-world evaluations, RoboRouter consistently outperforms than in-dividual policies, improving average success rate by more than 3% in simulation and over 13% in real-world settings, while preserving execution efficiency. Our results demonstrate that intelligent routing across heterogeneous, off-the-shelf policies provides a practical and scalable pathway toward building more capable robotic systems.

Problem

Research questions and friction points this paper is trying to address.

policy routing

robotic manipulation

heterogeneous policies

generalization

training-free

Innovation

Methods, ideas, or system contributions that make the work stand out.

training-free

policy routing

robotic manipulation

heterogeneous policies

semantic task representation

🔎 Similar Papers

No similar papers found.

Toyota Research Institute

Los Altos, CA / Cambridge, MA

Research Scientist Intern, Robotic Control Policy (PhD)