On the System Theoretic Offline Learning of Continuous-Time LQR with Exogenous Disturbances

📅 2025-09-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the offline learning problem for continuous-time linear quadratic regulators (LQR) under uncertain external disturbances, distinguishing between exogenous disturbances that are measurable (enabling deterministic estimation) and those that are unmeasurable (stochastic and unknown). We propose a unified offline learning framework integrating adaptive dynamic programming, Lyapunov stability analysis, and sample-based Markov decision process (MDP) approximation. Without requiring online system interaction, the framework guarantees global asymptotic stability and convergence of the learned control gain. Our key contribution is the first rigorously justified offline learning mechanism for continuous-time LQR under non-observable stochastic disturbances—achieving both theoretical soundness grounded in control theory and algorithmic simplicity. Numerical experiments demonstrate the method’s robustness and learning efficiency across diverse disturbance classes.

Technology Category

Application Category

📝 Abstract
We analyze offline designs of linear quadratic regulator (LQR) strategies with uncertain disturbances. First, we consider the scenario where the exogenous variable can be estimated in a controlled environment, and subsequently, consider a more practical and challenging scenario where it is unknown in a stochastic setting. Our approach builds on the fundamental learning-based framework of adaptive dynamic programming (ADP), combined with a Lyapunov-based analytical methodology to design the algorithms and derive sample-based approximations motivated from the Markov decision process (MDP)-based approaches. For the scenario involving non-measurable disturbances, we further establish stability and convergence guarantees for the learned control gains under sample-based approximations. The overall methodology emphasizes simplicity while providing rigorous guarantees. Finally, numerical experiments focus on the intricacies and validations for the design of offline continuous-time LQR with exogenous disturbances.
Problem

Research questions and friction points this paper is trying to address.

Designing offline LQR strategies with uncertain exogenous disturbances
Handling both measurable and unknown disturbances in stochastic settings
Providing stability guarantees for learned control gains under approximations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines adaptive dynamic programming with Lyapunov analysis
Develops sample-based approximations from MDP approaches
Provides stability guarantees for non-measurable disturbances
🔎 Similar Papers
No similar papers found.