Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Network Modeling

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of deploying nonlinear model predictive control (NMPC) on embedded platforms, where its online computational complexity is prohibitive, and existing learning-based approximation methods suffer from high training costs and heavy reliance on expert demonstration data. To overcome these limitations, the authors propose Sequential-AMPC, a novel approach employing a recurrent neural policy with shared parameters to generate candidate control sequences, thereby substantially reducing dependence on expert trajectories. The method further incorporates a safety-enhanced online evaluation and fallback mechanism to ensure closed-loop safety. Compared to feedforward baselines, Sequential-AMPC achieves faster training in high-dimensional systems, requires significantly less expert data, yields more feasible candidate sequences, and demonstrates consistently improved and stable validation performance.

Technology Category

Application Category

📝 Abstract
The practical deployment of nonlinear model predictive control (NMPC) is often limited by online computation: solving a nonlinear program at high control rates can be expensive on embedded hardware, especially when models are complex or horizons are long. Learning-based NMPC approximations shift this computation offline but typically demand large expert datasets and costly training. We propose Sequential-AMPC, a sequential neural policy that generates MPC candidate control sequences by sharing parameters across the prediction horizon. For deployment, we wrap the policy in a safety-augmented online evaluation and fallback mechanism, yielding Safe Sequential-AMPC. Compared to a naive feedforward policy baseline across several benchmarks, Sequential-AMPC requires substantially fewer expert MPC rollouts and yields candidate sequences with higher feasibility rates and improved closed-loop safety. On high-dimensional systems, it also exhibits better learning dynamics and performance in fewer epochs while maintaining stable validation improvement where the feedforward baseline can stagnate.
Problem

Research questions and friction points this paper is trying to address.

nonlinear model predictive control
online computation
learning-based control
expert dataset
embedded hardware
Innovation

Methods, ideas, or system contributions that make the work stand out.

Sequential-AMPC
safe learning-based control
recurrent neural network modeling
nonlinear model predictive control
parameter sharing across horizon
🔎 Similar Papers
No similar papers found.
M
Mihaela-Larisa Clement
Institute of Computer Engineering, TU Wien, Vienna, Austria; AIT Austrian Institute of Technology, Vienna, Austria
M
Mónika Farsang
Institute of Computer Engineering, TU Wien, Vienna, Austria
A
Agnes Poks
Institute of Mechanics and Mechatronics, TU Wien, Vienna, Austria
Johannes Edelmann
Johannes Edelmann
TU Wien
M
Manfred Plöchl
Institute of Mechanics and Mechatronics, TU Wien, Vienna, Austria
Radu Grosu
Radu Grosu
Professor of Computer Science
Hybrid SystemsCyber-Physical SystemsModelingAnalysis and ControlComputational Systems Biology
Ezio Bartocci
Ezio Bartocci
Faculty of Informatics, TU Wien
Cyber-Physical SystemsInternet of ThingsRuntime VerificationCyber-securitySafe AI