Multi-Goal Dexterous Hand Manipulation using Probabilistic Model-based Reinforcement Learning

📅 2025-04-30

📈 Citations: 0

✨ Influential: 0

career value

207K/year

🤖 AI Summary

Addressing the challenges of high-dimensional dynamic modeling and millisecond-level latency sensitivity in dexterous multi-objective manipulation, this paper proposes a Goal-Conditioned Probabilistic Model Predictive Control (GC-PMPC) framework. GC-PMPC integrates an asynchronous MPC architecture with an ensemble of probabilistic neural networks to explicitly model system uncertainty, while enabling joint optimization of tactile feedback and goal-conditioned policies. Evaluated on Shadow Hand simulation and the real-world DexHand 021 platform (12-DoF + 5 tactile channels), GC-PMPC achieves high-precision, real-time, adaptive manipulation of a cube toward arbitrary target poses—requiring only 80 minutes of training. It significantly outperforms state-of-the-art RL-based and deterministic MPC methods. The core contribution lies in the first deep integration of goal conditioning, probabilistic modeling, and asynchronous MPC—thereby simultaneously ensuring robustness, generalization across unseen goals, and sub-10-ms closed-loop response times.

Technology Category

Application Category

📝 Abstract

This paper tackles the challenge of learning multi-goal dexterous hand manipulation tasks using model-based Reinforcement Learning. We propose Goal-Conditioned Probabilistic Model Predictive Control (GC-PMPC) by designing probabilistic neural network ensembles to describe the high-dimensional dexterous hand dynamics and introducing an asynchronous MPC policy to meet the control frequency requirements in real-world dexterous hand systems. Extensive evaluations on four simulated Shadow Hand manipulation scenarios with randomly generated goals demonstrate GC-PMPC's superior performance over state-of-the-art baselines. It successfully drives a cable-driven Dexterous hand, DexHand 021 with 12 Active DOFs and 5 tactile sensors, to learn manipulating a cubic die to three goal poses within approximately 80 minutes of interactions, demonstrating exceptional learning efficiency and control performance on a cost-effective dexterous hand platform.

Problem

Research questions and friction points this paper is trying to address.

Learning multi-goal dexterous hand manipulation tasks

High-dimensional dynamics modeling with probabilistic neural networks

Real-time control for cost-effective dexterous hand systems

Innovation

Methods, ideas, or system contributions that make the work stand out.

Goal-Conditioned Probabilistic Model Predictive Control

Probabilistic neural network ensembles for dynamics

Asynchronous MPC policy for real-time control

🔎 Similar Papers

Omnigrasp: Grasping Diverse Objects with Simulated Humanoids

2024-07-16Neural Information Processing SystemsCitations: 16