Bi-HIL: Bilateral Control-Based Multimodal Hierarchical Imitation Learning via Subtask-Level Progress Rate and Keyframe Memory for Long-Horizon Contact-Rich Robotic Manipulation

๐Ÿ“… 2026-03-04
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenges of long-horizon, contact-intensive robotic manipulation, where partial observability and contact uncertainty often lead to unstable subtask transitions and poor coordination. To this end, the authors propose a bilateral controlโ€“based multimodal hierarchical imitation learning framework that, for the first time, integrates subtask progress modeling with a keyframe memory mechanism to dynamically condition both high- and low-level policies. This integration enhances contact-awareness and long-term task coordination. Experimental results demonstrate that the proposed approach significantly outperforms flat policies and ablated variants on both single-arm and dual-arm real-world robotic tasks, exhibiting superior robustness and effectiveness in complex contact-rich scenarios.
๐Ÿ“ Abstract
Long-horizon contact-rich robotic manipulation remains challenging due to partial observability and unstable subtask transitions under contact uncertainty. While hierarchical architectures improve temporal reasoning and bilateral imitation learning enables force-aware control, existing approaches often rely on flat policies that struggle with long-horizon coordination. We propose Bi-HIL, a bilateral control-based multimodal hierarchical imitation learning framework for long-horizon manipulation. Bi-HIL stabilizes hierarchical coordination by integrating keyframe memory with subtask-level progress rate that models phase progression within the active subtask and conditions both high- and low-level policies. We evaluate Bi-HIL on unimanual and bimanual real-robot tasks, demonstrating consistent improvements over flat and ablated variants. The results highlight the importance of explicitly modeling subtask progression together with force-aware control for robust long-horizon manipulation. For additional material, please check: https://mertcookimg.github.io/bi-hil
Problem

Research questions and friction points this paper is trying to address.

long-horizon manipulation
contact-rich
partial observability
subtask transition
hierarchical coordination
Innovation

Methods, ideas, or system contributions that make the work stand out.

bilateral control
hierarchical imitation learning
subtask-level progress rate
keyframe memory
contact-rich manipulation
๐Ÿ”Ž Similar Papers
No similar papers found.
T
Thanpimon Buamanee
The University of Osaka
M
Masato Kobayashi
The University of Osaka, Kobe University
Yuki Uranishi
Yuki Uranishi
The University of Osaka
Computer VisionXRHuman Computer Interaction