From Simple to Complex Skills: The Case of In-Hand Object Reorientation

📅 2025-01-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Dexterous hand reorientation of symmetric or textureless objects in real-world environments suffers from heavy reliance on manual hyperparameter tuning, inaccurate pose estimation, and poor sim-to-real transfer robustness. Method: We propose a skill-driven hierarchical reinforcement learning framework: a high-level policy dynamically composes pre-trained low-level rotational skills based on proprioceptive feedback and control errors; a low-level module introduces a recursive pose estimator leveraging only joint encoder readings and execution error, enabling continuous pose tracking for symmetric and textureless objects. Contribution/Results: Our approach requires no modification to reward functions, task specifications, or system configurations, significantly reducing human intervention during sim-to-real transfer. Experiments demonstrate zero-shot, highly robust object reorientation across diverse complex objects, with markedly improved out-of-distribution disturbance robustness compared to end-to-end methods.

Technology Category

Application Category

📝 Abstract
Learning policies in simulation and transferring them to the real world has become a promising approach in dexterous manipulation. However, bridging the sim-to-real gap for each new task requires substantial human effort, such as careful reward engineering, hyperparameter tuning, and system identification. In this work, we present a system that leverages low-level skills to address these challenges for more complex tasks. Specifically, we introduce a hierarchical policy for in-hand object reorientation based on previously acquired rotation skills. This hierarchical policy learns to select which low-level skill to execute based on feedback from both the environment and the low-level skill policies themselves. Compared to learning from scratch, the hierarchical policy is more robust to out-of-distribution changes and transfers easily from simulation to real-world environments. Additionally, we propose a generalizable object pose estimator that uses proprioceptive information, low-level skill predictions, and control errors as inputs to estimate the object pose over time. We demonstrate that our system can reorient objects, including symmetrical and textureless ones, to a desired pose.
Problem

Research questions and friction points this paper is trying to address.

Adaptive Learning
Real-world Object Orientation
Symmetric/Textureless Objects
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Skill System
Intelligent Object Position Predictor
Stable Transfer from Virtual to Reality
🔎 Similar Papers
No similar papers found.
Haozhi Qi
Haozhi Qi
UC Berkeley
RoboticsDeep LearningComputer Vision
Brent Yi
Brent Yi
University of California, Berkeley
M
Mike Lambeta
FAIR at Meta
Y
Yi Ma
UC Berkeley
R
Roberto Calandra
TU Dresden, Centre for Tactile Internet with Human-in-the-Loop
J
Jitendra Malik
UC Berkeley, FAIR at Meta