Self-supervised perception for tactile skin covered dexterous hands

📅 2025-05-16

📈 Citations: 0

✨ Influential: 0

career value

223K/year

🤖 AI Summary

Current dexterous robotic hands suffer from incomplete tactile sensing across the entire hand (fingertips, phalanges, palm), poor interpretability of magnetic skin sensors, cross-device calibration difficulties, and heavy reliance on labeled data. To address these challenges, this paper introduces Sparsh-skin: the first self-supervised pre-trained encoder specifically designed for full-hand magnetic tactile skins. Its core innovation lies in the first-ever self-supervised, self-distillation pre-training framework for magnetic tactile sequences, jointly modeling kinematic and magnetic flux temporal dynamics—thereby overcoming the inherent low physical interpretability of magnetic signals and hardware heterogeneity constraints. The model enables zero-shot transfer to downstream tasks including state estimation and policy learning. In benchmark evaluations, it achieves over 41% improvement in sample efficiency and outperforms existing methods by more than 56% in performance.

Technology Category

Application Category

📝 Abstract

We present Sparsh-skin, a pre-trained encoder for magnetic skin sensors distributed across the fingertips, phalanges, and palm of a dexterous robot hand. Magnetic tactile skins offer a flexible form factor for hand-wide coverage with fast response times, in contrast to vision-based tactile sensors that are restricted to the fingertips and limited by bandwidth. Full hand tactile perception is crucial for robot dexterity. However, a lack of general-purpose models, challenges with interpreting magnetic flux and calibration have limited the adoption of these sensors. Sparsh-skin, given a history of kinematic and tactile sensing across a hand, outputs a latent tactile embedding that can be used in any downstream task. The encoder is self-supervised via self-distillation on a variety of unlabeled hand-object interactions using an Allegro hand sensorized with Xela uSkin. In experiments across several benchmark tasks, from state estimation to policy learning, we find that pretrained Sparsh-skin representations are both sample efficient in learning downstream tasks and improve task performance by over 41% compared to prior work and over 56% compared to end-to-end learning.

Problem

Research questions and friction points this paper is trying to address.

Developing self-supervised tactile perception for dexterous robot hands

Overcoming limitations of magnetic tactile skin interpretation and calibration

Enhancing robot dexterity with full-hand tactile sensing and pre-trained models

Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-supervised encoder for magnetic tactile skin

Full-hand coverage with fast response sensors

Pretrained latent embeddings boost task performance

🔎 Similar Papers

Canonical Representation and Force-Based Pretraining of 3D Tactile for Dexterous Visuo-Tactile Policy Learning