Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization

📅 2024-03-21
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF

career value

198K/year
🤖 AI Summary
Real-time economic nonlinear model predictive control (eNMPC) faces significant computational burdens from first-principles dynamic models, hindering simultaneous satisfaction of real-time implementation requirements and strict constraint enforcement. Method: This paper proposes a task-driven Koopman surrogate modeling framework that jointly embeds differentiable simulation and eNMPC optimization into surrogate training. Leveraging implicit gradient propagation, it enables end-to-end structured learning—bypassing conventional reinforcement learning paradigms while inherently guaranteeing hard constraint satisfaction. Contribution/Results: Theoretical integration combines Koopman operator theory with gradient-based policy optimization. Experimental evaluation on the CSTR benchmark demonstrates that the proposed method achieves comparable economic performance to state-of-the-art approaches while eliminating all constraint violations, thereby substantially improving controller reliability and real-time feasibility.

Technology Category

Application Category

📝 Abstract
Mechanistic dynamic process models may be too computationally expensive to be usable as part of a real-time capable predictive controller. We present a method for end-to-end learning of Koopman surrogate models for optimal performance in a specific control task. In contrast to previous contributions that employ standard reinforcement learning (RL) algorithms, we use a training algorithm that exploits the differentiability of environments based on mechanistic simulation models to aid the policy optimization. We evaluate the performance of our method by comparing it to that of other training algorithms on an existing economic nonlinear model predictive control (eNMPC) case study of a continuous stirred-tank reactor (CSTR) model. Compared to the benchmark methods, our method produces similar economic performance while eliminating constraint violations. Thus, for this case study, our method outperforms the others and offers a promising path toward more performant controllers that employ dynamic surrogate models.
Problem

Research questions and friction points this paper is trying to address.

Develop task-optimal surrogate models for eNMPC
Use differentiable simulation for policy optimization
Achieve economic performance without constraint violations
Innovation

Methods, ideas, or system contributions that make the work stand out.

End-to-end learning of Koopman surrogate models
Differentiable simulation for policy optimization
Constraint-free economic performance in eNMPC
💼 Related Jobs
Postdoctoral Fellow – AI/ML Enabled Bioprocess Modeling and Control
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Andover