Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections

📅 2025-06-20

📈 Citations: 0

✨ Influential: 0

career value

193K/year

🤖 AI Summary

To address the inefficiency of DAgger data collection and difficulty in policy updates for contact-intensive manipulation tasks, this paper proposes a human-robot collaborative framework with flexible intervention. It introduces a lightweight, in-execution human action correction interface, integrates force-sensing closed-loop control with compliance regulation, and develops a residual-policy learning mechanism for adaptive policy refinement. Crucially, it incorporates real-time force feedback into residual policy learning to enable online, non-interruptive human corrective interventions. The DAgger pipeline is further optimized for both data acquisition and policy iteration. Evaluated on a physical robot platform, the approach improves base-policy success rates by over 50% on book-flipping and belt-assembly tasks—substantially outperforming both from-scratch training and fine-tuning baselines—while requiring only minimal human correction samples.

Technology Category

Application Category

📝 Abstract

We address key challenges in Dataset Aggregation (DAgger) for real-world contact-rich manipulation: how to collect informative human correction data and how to effectively update policies with this new data. We introduce Compliant Residual DAgger (CR-DAgger), which contains two novel components: 1) a Compliant Intervention Interface that leverages compliance control, allowing humans to provide gentle, accurate delta action corrections without interrupting the ongoing robot policy execution; and 2) a Compliant Residual Policy formulation that learns from human corrections while incorporating force feedback and force control. Our system significantly enhances performance on precise contact-rich manipulation tasks using minimal correction data, improving base policy success rates by over 50% on two challenging tasks (book flipping and belt assembly) while outperforming both retraining-from-scratch and finetuning approaches. Through extensive real-world experiments, we provide practical guidance for implementing effective DAgger in real-world robot learning tasks. Result videos are available at: https://compliant-residual-dagger.github.io/

Problem

Research questions and friction points this paper is trying to address.

Improving real-world contact-rich manipulation with human corrections

Collecting informative human correction data for DAgger

Updating policies effectively with new human correction data

Innovation

Methods, ideas, or system contributions that make the work stand out.

Compliant Intervention Interface for gentle corrections

Compliant Residual Policy with force feedback

Enhances manipulation tasks using minimal correction data

🔎 Similar Papers

MimicTouch: Leveraging Multi-modal Human Tactile Demonstrations for Contact-rich Manipulation

2023-10-25Citations: 1

Haptic-ACT: Bridging Human Intuition with Compliant Robotic Manipulation via Immersive VR

2024-09-18arXiv.orgCitations: 2