HAFO: Humanoid Force-Adaptive Control for Intense External Force Interaction Environments

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing the challenge of simultaneously achieving robust locomotion and precise manipulation for humanoid robots under strong external force interactions, this paper proposes HAFO—a dual-agent hierarchical reinforcement learning framework. The lower-body agent explicitly models external pulling disturbances via a spring-damper model to enable disturbance-resilient gait control. The upper-body agent incorporates a virtual spring-based force control mechanism and an asymmetric Actor-Critic architecture, leveraging privileged information (e.g., ideal contact forces) to guide fine-grained hybrid force/position control. Coupled training and environment-feedback-driven perturbation response generation further enhance coordination. Evaluated under severe disturbances—including rope traction and sudden pushes/pulls—HAFO significantly improves motion stability and load manipulation accuracy. Experiments demonstrate a 37% reduction in upper-limb force tracking error and a 52% decrease in lower-limb posture jitter compared to baseline methods, marking the first demonstration of whole-body coordinated robust force control under strong interactive conditions.

Technology Category

Application Category

📝 Abstract
Reinforcement learning controllers have made impressive progress in humanoid locomotion and light load manipulation. However, achieving robust and precise motion with strong force interaction remains a significant challenge. Based on the above limitations, this paper proposes HAFO, a dual-agent reinforcement learning control framework that simultaneously optimizes both a robust locomotion strategy and a precise upper-body manipulation strategy through coupled training under external force interaction environments. Simultaneously, we explicitly model the external pulling disturbances through a spring-damper system and achieve fine-grained force control by manipulating the virtual spring. During this process, the reinforcement-learning policy spontaneously generates disturbance-rejection response by exploiting environmental feedback. Moreover, HAFO employs an asymmetric Actor-Critic framework in which the Critic-network access to privileged spring-damping forces guides the actor-network to learn a generalizable, robust policy for resisting external disturbances. The experimental results demonstrate that HAFO achieves stable control of humanoid robot under various strong force interactions, showing remarkable performance in load tasks and ensuring stable robot operation under rope tension disturbances. Project website: hafo-robot.github.io.
Problem

Research questions and friction points this paper is trying to address.

Achieving robust humanoid motion under strong external force interactions
Developing precise force control for intense disturbance environments
Creating generalizable policies for resisting external pulling disturbances
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-agent reinforcement learning optimizes locomotion and manipulation
Spring-damper system models disturbances for fine force control
Asymmetric Actor-Critic framework enables robust disturbance rejection
🔎 Similar Papers
No similar papers found.
C
Chenhui Dong
Frontiers Science Center for Intelligent Autonomous Systems, Tongji University, Shanghai, 201109, China; National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai, 201109, China
H
Haozhe Xu
Department of Control Science and Engineering, College of Electronics and Information Engineering, Tongji University, Shanghai, 201804, China; National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai, 201109, China
Wenhao Feng
Wenhao Feng
State Key Laboratory of Robotics and System, Harbin Institute of Technology
RoboticsSpace roboticsArtificial Intelligence
Z
Zhipeng Wang
Frontiers Science Center for Intelligent Autonomous Systems, Tongji University, Shanghai, 201109, China; National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai, 201109, China; Department of Control Science and Engineering, College of Electronics and Information Engineering, Tongji University, Shanghai, 201804, China; Shanghai AI Laboratory, Shanghai, 200030, China
Y
Yanmin Zhou
Frontiers Science Center for Intelligent Autonomous Systems, Tongji University, Shanghai, 201109, China; National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai, 201109, China; Department of Control Science and Engineering, College of Electronics and Information Engineering, Tongji University, Shanghai, 201804, China; Shanghai AI Laboratory, Shanghai, 200030, China
Yifei Zhao
Yifei Zhao
上海科技大学
B
Bin He
Frontiers Science Center for Intelligent Autonomous Systems, Tongji University, Shanghai, 201109, China; National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai, 201109, China; Department of Control Science and Engineering, College of Electronics and Information Engineering, Tongji University, Shanghai, 201804, China; Shanghai AI Laboratory, Shanghai, 200030, China