DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming

📅 2026-03-23

📈 Citations: 0

✨ Influential: 0

career value

230K/year

🤖 AI Summary

Dexterous robotic manipulation remains challenging due to the need to integrate in-hand control, high-frequency contact interactions, and long-horizon coordination. This work proposes a hierarchical, object-centric bimanual strategy using drumming as a comprehensive benchmark task for the first time. The approach combines trajectory planning with residual reinforcement learning and introduces a reward function that explicitly models interactions between fingers and drumsticks as well as between drumsticks and drum surfaces. An efficient simulation training pipeline coupled with a sim-to-real transfer framework enables effective deployment on physical hardware. Experiments demonstrate successful execution of diverse drumming sequences and advanced techniques in simulation, achieving 1.87× and 1.22× improvements in F1 scores for easy and difficult pieces, respectively. On the real robot, the method accomplishes full musical performances with an F1 score of 1.0.

Technology Category

Application Category

📝 Abstract

Performing in-hand, contact-rich, and long-horizon dexterous manipulation remains an unsolved challenge in robotics. Prior hand dexterity works have considered each of these three challenges in isolation, yet do not combine these skills into a single, complex task. To further test the capabilities of dexterity, we propose drumming as a testbed for dexterous manipulation. Drumming naturally integrates all three challenges: it involves in-hand control for stabilizing and adjusting the drumstick with the fingers, contact-rich interaction through repeated striking of the drum surface, and long-horizon coordination when switching between drums and sustaining rhythmic play. We present DexDrummer, a hierarchical object-centric bimanual drumming policy trained in simulation with sim-to-real transfer. The framework reduces the exploration difficulty of pure reinforcement learning by combining trajectory planning with residual RL corrections for fast transitions between drums. A dexterous manipulation policy handles contact-rich dynamics, guided by rewards that explicitly model both finger-stick and stick-drum interactions. In simulation, we show our policy can play two styles of music: multi-drum, bimanual songs and challenging, technical exercises that require increased dexterity. Across simulated bimanual tasks, our dexterous, reactive policy outperforms a fixed grasp policy by 1.87x across easy songs and 1.22x across hard songs F1 scores. In real-world tasks, we show song performance across a multi-drum setup. DexDrummer is able to play our training song and its extended version with an F1 score of 1.0.

Problem

Research questions and friction points this paper is trying to address.

dexterous manipulation

in-hand control

contact-rich interaction

long-horizon coordination

robot drumming

Innovation

Methods, ideas, or system contributions that make the work stand out.

dexterous manipulation

contact-rich interaction

long-horizon control

sim-to-real transfer

hierarchical policy

🔎 Similar Papers

No similar papers found.