HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies

📅 2026-03-12

📈 Citations: 0

✨ Influential: 0

career value

228K/year

🤖 AI Summary

This work addresses the challenge of controlling dexterous multi-fingered hands in millimeter-precision tasks—such as bimanual piano playing—where control often fails due to the scarcity of high-quality real-world data and the difficulty of sim-to-real transfer. To overcome this, the authors propose HandelBot, a two-stage framework that first performs structured joint-space alignment through physical experiments and then refines fine motor skills via residual reinforcement learning. Using only 30 minutes of real-world interaction data, HandelBot achieves high-fidelity performance across five standard piano pieces, outperforming direct simulation-to-reality deployment by a factor of 1.8. The key innovation lies in the integration of structured fine-tuning with residual learning, which substantially enhances the real-world adaptability of simulation-pretrained policies in precision manipulation tasks.

Technology Category

Application Category

📝 Abstract

Mastering dexterous manipulation with multi-fingered hands has been a grand challenge in robotics for decades. Despite its potential, the difficulty of collecting high-quality data remains a primary bottleneck for high-precision tasks. While reinforcement learning and simulation-to-real-world transfer offer a promising alternative, the transferred policies often fail for tasks demanding millimeter-scale precision, such as bimanual piano playing. In this work, we introduce HandelBot, a framework that combines a simulation policy and rapid adaptation through a two-stage pipeline. Starting from a simulation-trained policy, we first apply a structured refinement stage to correct spatial alignments by adjusting lateral finger joints based on physical rollouts. Next, we use residual reinforcement learning to autonomously learn fine-grained corrective actions. Through extensive hardware experiments across five recognized songs, we demonstrate that HandelBot can successfully perform precise bimanual piano playing. Our system outperforms direct simulation deployment by a factor of 1.8x and requires only 30 minutes of physical interaction data.

Problem

Research questions and friction points this paper is trying to address.

dexterous manipulation

simulation-to-real transfer

bimanual piano playing

high-precision robotics

multi-fingered hands

Innovation

Methods, ideas, or system contributions that make the work stand out.

dexterous manipulation

simulation-to-real transfer

residual reinforcement learning