Onboard MuJoCo-based Model Predictive Control for Shipboard Crane with Double-Pendulum Sway Suppression

📅 2026-03-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of controlling offshore cranes, where wind and wave disturbances readily excite severe double-pendulum oscillations in suspended loads. Conventional control strategies rely on oversimplified models, while reinforcement learning approaches often suffer from limited generalization. To overcome these limitations, the paper proposes a real-time model predictive control (MPC) framework leveraging MuJoCo-based physics simulation, integrated with the cross-entropy method (CEM) for sampling-based receding-horizon optimization. The approach directly evaluates action sequences within the simulator to jointly optimize target tracking and swing suppression, eliminating the need for complex analytical models or extensive offline training. Notably, it achieves efficient and robust control of the double-pendulum system—including unmodeled perturbations such as added payloads and external disturbances—on resource-constrained embedded platforms. Experimental results demonstrate significant performance improvements over traditional PID and deep reinforcement learning baselines, with full real-time deployability.

Technology Category

Application Category

📝 Abstract
Transferring heavy payloads in maritime settings relies on efficient crane operation, limited by hazardous double-pendulum payload sway. This sway motion is further exacerbated in offshore environments by external perturbations from wind and ocean waves. Manual suppression of these oscillations on an underactuated crane system by human operators is challenging. Existing control methods struggle in such settings, often relying on simplified analytical models, while deep reinforcement learning (RL) approaches tend to generalise poorly to unseen conditions. Deploying a predictive controller onto compute-constrained, highly non-linear physical systems without relying on extensive offline training or complex analytical models remains a significant challenge. Here we show a complete real-time control pipeline centered on the MuJoCo MPC framework that leverages a cross-entropy method planner to evaluate candidate action sequences directly within a physics simulator. By using simulated rollouts, this sampling-based approach successfully reconciles the conflicting objectives of dynamic target tracking and sway damping without relying on complex analytical models. We demonstrate that the controller can run effectively on a resource-constrained embedded hardware, while outperforming traditional PID and RL baselines in counteracting external base perturbations. Furthermore, our system demonstrates robustness even when subjected to unmodeled physical discrepancies like the introduction of a second payload.
Problem

Research questions and friction points this paper is trying to address.

double-pendulum sway
shipboard crane
model predictive control
external perturbations
underactuated system
Innovation

Methods, ideas, or system contributions that make the work stand out.

Model Predictive Control
MuJoCo
Double-Pendulum Sway Suppression
Cross-Entropy Method
Embedded Real-Time Control
🔎 Similar Papers
No similar papers found.
O
Oscar Pang
Adaptive and Intelligent Robotics Lab, Department of Computing, Imperial College London, Exhibition Rd, London, SW7 2BX, United Kingdom
L
Lisa Coiffard
Adaptive and Intelligent Robotics Lab, Department of Computing, Imperial College London, Exhibition Rd, London, SW7 2BX, United Kingdom
P
Paul Templier
Adaptive and Intelligent Robotics Lab, Department of Computing, Imperial College London, Exhibition Rd, London, SW7 2BX, United Kingdom
L
Luke Beddow
Adaptive and Intelligent Robotics Lab, Department of Computing, Imperial College London, Exhibition Rd, London, SW7 2BX, United Kingdom
Kamil Dreczkowski
Kamil Dreczkowski
Research Associate at Imperial College London
Robot LearningRobot ManipulationBayesian Optimisation
Antoine Cully
Antoine Cully
Professor of Machine Learning and Robotics at Imperial College London
Robot LearningMachine LearningQuality-DiversityReinforcement LearningNeuroevolution