🤖 AI Summary
Model mismatch severely degrades controller robustness under parametric uncertainties and environmental disturbances. Method: This paper proposes a modular residual learning framework that synergistically integrates model-based components—dynamic modeling, model-based gait planning, and model predictive control—with data-driven learning. Crucially, lightweight residual networks are embedded at key stages of gait planning and dynamics modeling, with learning strategies customized to each component’s characteristics to improve training efficiency and generalization. These residual modules are end-to-end optimized via deep reinforcement learning to achieve high-fidelity compensation in simulation. Contribution/Results: The resulting hybrid controller significantly enhances the nominal controller’s robustness against parameter variations and external disturbances. Experiments on a real quadrupedal robot demonstrate stable locomotion and precise velocity tracking. Notably, the method maintains balance and reliably executes motion commands even under strong unmodeled uncertainties—such as unknown terrain and actuator delays—beyond the scope of the simulation model.
📝 Abstract
This paper presents a novel approach that combines the advantages of both model-based and learning-based frameworks to achieve robust locomotion. The residual modules are integrated with each corresponding part of the model-based framework, a footstep planner and dynamic model designed using heuristics, to complement performance degradation caused by a model mismatch. By utilizing a modular structure and selecting the appropriate learning-based method for each residual module, our framework demonstrates improved control performance in environments with high uncertainty, while also achieving higher learning efficiency compared to baseline methods. Moreover, we observed that our proposed methodology not only enhances control performance but also provides additional benefits, such as making nominal controllers more robust to parameter tuning. To investigate the feasibility of our framework, we demonstrated residual modules combined with model predictive control in a real quadrupedal robot. Despite uncertainties beyond the simulation, the robot successfully maintains balance and tracks the commanded velocity.