Learning Dexterous Manipulation with Quantized Hand State

📅 2025-09-22

📈 Citations: 0

✨ Influential: 0

career value

170K/year

🤖 AI Summary

In dexterous robotic manipulation, high-dimensional hand actions dominate the coupled action space, degrading arm control performance. To address this, we propose a state-quantization-driven arm-hand decoupling learning method. Our core contribution is a discrete quantization encoding of hand states combined with continuous relaxation, which preserves fine-grained hand manipulation capability while significantly reducing action representation dimensionality. This enables construction of a compact joint action space tailored for diffusion-based policies, facilitating synergistic modeling and decoupled optimization of hand and arm actions. Within a data-driven framework, the method enhances policy learning stability and convergence efficiency. Experiments on complex dexterous manipulation tasks demonstrate superior generalization, balanced arm-hand control, and higher task success rates compared to baseline approaches.

Technology Category

Application Category

📝 Abstract

Dexterous robotic hands enable robots to perform complex manipulations that require fine-grained control and adaptability. Achieving such manipulation is challenging because the high degrees of freedom tightly couple hand and arm motions, making learning and control difficult. Successful dexterous manipulation relies not only on precise hand motions, but also on accurate spatial positioning of the arm and coordinated arm-hand dynamics. However, most existing visuomotor policies represent arm and hand actions in a single combined space, which often causes high-dimensional hand actions to dominate the coupled action space and compromise arm control. To address this, we propose DQ-RISE, which quantizes hand states to simplify hand motion prediction while preserving essential patterns, and applies a continuous relaxation that allows arm actions to diffuse jointly with these compact hand states. This design enables the policy to learn arm-hand coordination from data while preventing hand actions from overwhelming the action space. Experiments show that DQ-RISE achieves more balanced and efficient learning, paving the way toward structured and generalizable dexterous manipulation. Project website: http://rise-policy.github.io/DQ-RISE/

Problem

Research questions and friction points this paper is trying to address.

High-dimensional hand actions dominate coupled action space in dexterous manipulation

Existing visuomotor policies compromise arm control due to combined action representation

Learning arm-hand coordination is difficult with tightly coupled high-DOF motions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantizes hand states to simplify motion prediction

Applies continuous relaxation for joint arm-hand diffusion

Prevents hand actions from overwhelming the action space

🔎 Similar Papers

No similar papers found.