🤖 AI Summary
This work addresses the global swing-up and stabilization control problem for underactuated double-pendulum systems (Acrobot/Pendubot). We propose a lightweight, real-time nonlinear model predictive control (NMPC) framework grounded in exact nonlinear dynamical modeling, incorporating efficient real-time optimization and explicit constraint handling to guarantee global convergence from arbitrary initial states. Compared to conventional LQR and reinforcement learning baselines, the method significantly improves robustness against initial condition sensitivity and external disturbances: it achieves 100% swing-up success rate on the RealAIGym platform with end-to-end latency under 50 ms. The core contribution lies in an NMPC architecture that jointly ensures theoretical guarantees—such as global stability and constraint satisfaction—and engineering feasibility—namely hard real-time execution—thereby delivering a verifiable, deployable global control solution for underactuated systems.
📝 Abstract
The 3rd AI Olympics with RealAIGym competition poses the challenge of developing a global policy that can swing up and stabilize an underactuated 2-link system Acrobot and/or Pendubot from any configuration in the state space. This paper presents an optimal control-based approach using a real-time Nonlinear Model Predictive Control (MPC). The results show that the controller achieves good performance and robustness and can reliably handle disturbances.