Dexterous Manipulation Transfer via Progressive Kinematic-Dynamic Alignment

๐Ÿ“… 2025-11-14
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Data collection for dexterous manipulation with multi-fingered robotic hands is challenging and poorly scalable, resulting in a severe scarcity of high-quality training data and hindering data-driven policy learning. To address this, we propose a hand-agnostic, progressive kinematics-dynamics alignment framework that synthesizes semantically consistent and motion-smooth robotic dexterous manipulation trajectories directly from monocular human demonstration videosโ€”without requiring any real-world robot interaction data. Our method integrates kinematic matching, thumb-guided initialization, action-space remapping, residual policy learning under a unified reward, and wrist-trajectory optimization to enhance inter-finger coordination. Evaluated across diverse hand morphologies, objects, and manipulation tasks, the framework enables efficient cross-domain policy transfer, achieving an average success rate of 73%. It significantly improves data efficiency and generalization capability over prior approaches.

Technology Category

Application Category

๐Ÿ“ Abstract
The inherent difficulty and limited scalability of collecting manipulation data using multi-fingered robot hand hardware platforms have resulted in severe data scarcity, impeding research on data-driven dexterous manipulation policy learning. To address this challenge, we present a hand-agnostic manipulation transfer system. It efficiently converts human hand manipulation sequences from demonstration videos into high-quality dexterous manipulation trajectories without requirements of massive training data. To tackle the multi-dimensional disparities between human hands and dexterous hands, as well as the challenges posed by high-degree-of-freedom coordinated control of dexterous hands, we design a progressive transfer framework: first, we establish primary control signals for dexterous hands based on kinematic matching; subsequently, we train residual policies with action space rescaling and thumb-guided initialization to dynamically optimize contact interactions under unified rewards; finally, we compute wrist control trajectories with the objective of preserving operational semantics. Using only human hand manipulation videos, our system automatically configures system parameters for different tasks, balancing kinematic matching and dynamic optimization across dexterous hands, object categories, and tasks. Extensive experimental results demonstrate that our framework can automatically generate smooth and semantically correct dexterous hand manipulation that faithfully reproduces human intentions, achieving high efficiency and strong generalizability with an average transfer success rate of 73%, providing an easily implementable and scalable method for collecting robot dexterous manipulation data.
Problem

Research questions and friction points this paper is trying to address.

Addresses data scarcity in dexterous manipulation policy learning
Transfers human hand manipulation to robot hands despite kinematic differences
Solves high-DOF coordinated control challenges for dexterous robotic hands
Innovation

Methods, ideas, or system contributions that make the work stand out.

Progressive kinematic-dynamic alignment transfer framework
Residual policies optimize contact with action rescaling
Wrist trajectories preserve operational semantics automatically
๐Ÿ”Ž Similar Papers
No similar papers found.
W
Wenbin Bai
School of Information and Communication Engineering, Dalian University of Technology, China
Qiyu Chen
Qiyu Chen
Institute of Automation, Chinese Academy of Sciences
Anomaly DetectionComputer VisionDeep Learning
X
Xiangbo Lin
School of Information and Communication Engineering, Dalian University of Technology, China
J
Jianwen Li
Shenyang Institute of Automation, Chinese Academy of Sciences, China
Q
Quancheng Li
School of Information and Communication Engineering, Dalian University of Technology, China
H
Hejiang Pan
School of Information and Communication Engineering, Dalian University of Technology, China
Y
Yi Sun
School of Information and Communication Engineering, Dalian University of Technology, China