π€ AI Summary
Addressing the challenge of tightly coupling base navigation, manipulator control, and visual perception in mobile manipulation robots, this paper proposes an embodied multimodal closed-loop control framework. Methodologically, it introduces three key innovations: (1) a differentiable embodied function Ο(w) that dynamically balances prioritization between mobility and manipulation tasks; (2) a monitor-pose-guided servoing method (MPBS) ensuring robust end-to-end target tracking; and (3) a unified quadratic programming (QP)-based optimization model integrating kinematic constraints, visual feedback, and task-specific requirements. Experimental evaluation in real-world scenarios demonstrates significant performance improvements: task success rate increases to 95.6%, and average execution time decreases by 52.8% compared to conventional hierarchical control approaches. These results validate the frameworkβs effectiveness in enabling coordinated, responsive, and constraint-aware mobile manipulation.
π Abstract
Mobile manipulation typically entails the base for mobility, the arm for accurate manipulation, and the camera for perception. It is necessary to follow the principle of Distant Mobility, Close Grasping(DMCG) in holistic control. We propose Embodied Holistic Control for Mobile Manipulation(EHC-MM) with the embodied function of sig(w): By formulating the DMCG principle as a Quadratic Programming (QP) problem, sig(w) dynamically balances the robot's emphasis between movement and manipulation with the consideration of the robot's state and environment. In addition, we propose the Monitor-Position-Based Servoing (MPBS) with sig(w), enabling the tracking of the target during the operation. This approach allows coordinated control between the robot's base, arm, and camera. Through extensive simulations and real-world experiments, our approach significantly improves both the success rate and efficiency of mobile manipulation tasks, achieving a 95.6% success rate in the real-world scenarios and a 52.8% increase in time efficiency.