🤖 AI Summary
To address the inefficiency of DAgger data collection and difficulty in policy updates for contact-intensive manipulation tasks, this paper proposes a human-robot collaborative framework with flexible intervention. It introduces a lightweight, in-execution human action correction interface, integrates force-sensing closed-loop control with compliance regulation, and develops a residual-policy learning mechanism for adaptive policy refinement. Crucially, it incorporates real-time force feedback into residual policy learning to enable online, non-interruptive human corrective interventions. The DAgger pipeline is further optimized for both data acquisition and policy iteration. Evaluated on a physical robot platform, the approach improves base-policy success rates by over 50% on book-flipping and belt-assembly tasks—substantially outperforming both from-scratch training and fine-tuning baselines—while requiring only minimal human correction samples.
📝 Abstract
We address key challenges in Dataset Aggregation (DAgger) for real-world contact-rich manipulation: how to collect informative human correction data and how to effectively update policies with this new data. We introduce Compliant Residual DAgger (CR-DAgger), which contains two novel components: 1) a Compliant Intervention Interface that leverages compliance control, allowing humans to provide gentle, accurate delta action corrections without interrupting the ongoing robot policy execution; and 2) a Compliant Residual Policy formulation that learns from human corrections while incorporating force feedback and force control. Our system significantly enhances performance on precise contact-rich manipulation tasks using minimal correction data, improving base policy success rates by over 50% on two challenging tasks (book flipping and belt assembly) while outperforming both retraining-from-scratch and finetuning approaches. Through extensive real-world experiments, we provide practical guidance for implementing effective DAgger in real-world robot learning tasks. Result videos are available at: https://compliant-residual-dagger.github.io/