CMoE: Contrastive Mixture of Experts for Motion Control and Terrain Adaptation of Humanoid Robots

📅 2026-03-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of achieving robust autonomous locomotion for humanoid robots on complex terrains, where existing Mixture-of-Experts (MoE) models suffer from insufficient specialization due to convergent expert activation. To overcome this limitation, the authors propose CMoE, a novel framework that integrates contrastive learning into a single-stage reinforcement learning architecture with MoE. By maximizing the consistency of expert activations within the same terrain type and minimizing their similarity across different terrains, CMoE encourages terrain-specific expert specialization. The resulting end-to-end locomotion policy enables the Unitree G1 robot to successfully traverse 20 cm high steps and 80 cm wide gaps, demonstrating robust and natural gait patterns on mixed terrains and significantly outperforming current state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
For effective deployment in real-world environments, humanoid robots must autonomously navigate a diverse range of complex terrains with abrupt transitions. While the Vanilla mixture of experts (MoE) framework is theoretically capable of modeling diverse terrain features, in practice, the gating network exhibits nearly uniform expert activations across different terrains, weakening the expert specialization and limiting the model's expressive power. To address this limitation, we introduce CMoE, a novel single-stage reinforcement learning framework that integrates contrastive learning to refine expert activation distributions. By imposing contrastive constraints, CMoE maximizes the consistency of expert activations within the same terrain while minimizing their similarity across different terrains, thereby encouraging experts to specialize in distinct terrain types. We validated our approach on the Unitree G1 humanoid robot through a series of challenging experiments. Results demonstrate that CMoE enables the robot to traverse continuous steps up to 20 cm high and gaps up to 80 cm wide, while achieving robust and natural gait across diverse mixed terrains, surpassing the limits of existing methods. To support further research and foster community development, we release our code publicly.
Problem

Research questions and friction points this paper is trying to address.

humanoid robots
terrain adaptation
mixture of experts
expert specialization
motion control
Innovation

Methods, ideas, or system contributions that make the work stand out.

Contrastive Learning
Mixture of Experts
Terrain Adaptation
Humanoid Robots
Reinforcement Learning
🔎 Similar Papers
No similar papers found.
Shihao Ma
Shihao Ma
University of Toronto, Vector Institute
Machine LearningComputation BiologyAI in healthcare
H
Hongjin Chen
College of Intelligent Robotics and Advanced Manufacturing, Fudan University, Shanghai, China, 200433
Zijun Xu
Zijun Xu
ShanghaiTech University
Storage SystemHPCMachine Learning System
Y
Yi Zhao
College of Intelligent Robotics and Advanced Manufacturing, Fudan University, Shanghai, China, 200433
K
Ke Wu
College of Intelligent Robotics and Advanced Manufacturing, Fudan University, Shanghai, China, 200433
Ruichen Yang
Ruichen Yang
Johns Hopkins University
Brain-Computer Interface
L
Leyao Zou
College of Intelligent Robotics and Advanced Manufacturing, Fudan University, Shanghai, China, 200433
Z
Zhongxue Gan
College of Intelligent Robotics and Advanced Manufacturing, Fudan University, Shanghai, China, 200433
Wenchao Ding
Wenchao Ding
Tenure-track Associate Professor, Fudan University
RoboticsMotion PlanningAutonomous NavigationDecision Making