🤖 AI Summary
For large-scale single-product air separation units (ASUs), demand response control faces challenges including limited observable variables, frequent constraint violations, and difficulty in guaranteeing economic performance. To address these, this paper proposes an end-to-end reinforcement learning–driven Koopman surrogate modeling framework for economic nonlinear model predictive control (eNMPC). It is the first work to integrate deep reinforcement learning with Koopman operator theory, enabling data-driven, low-dimensional, and interpretable dynamic surrogate modeling. Evaluated on a high-fidelity ASU simulation platform, the method significantly reduces constraint violations while achieving economic performance comparable to conventional system identification–based approaches. Moreover, it demonstrates superior robustness against measurement noise and model mismatch. The framework exhibits strong scalability and engineering applicability, making it suitable for real-world industrial deployment in complex process systems.
📝 Abstract
With our recently proposed method based on reinforcement learning (Mayfrank et al. (2024), Comput. Chem. Eng. 190), Koopman surrogate models can be trained for optimal performance in specific (economic) nonlinear model predictive control ((e)NMPC) applications. So far, our method has exclusively been demonstrated on a small-scale case study. Herein, we show that our method scales well to a more challenging demand response case study built on a large-scale model of a single-product (nitrogen) air separation unit. Across all numerical experiments, we assume observability of only a few realistically measurable plant variables. Compared to a purely system identification-based Koopman eNMPC, which generates small economic savings but frequently violates constraints, our method delivers similar economic performance while avoiding constraint violations.