End-to-End Learning Framework for Solving Non-Markovian Optimal Control

📅 2025-02-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Fractional-order linear time-invariant (FOLTI) systems lack standardized control paradigms, posing significant challenges for system identification and optimal control. Method: This paper proposes FOLOC—the first end-to-end data-driven optimal control framework for FOLTI systems—integrating fractional-order linear quadratic regulator (LQR) theory with deep neural networks. FOLOC directly learns optimal non-Markovian policies without assuming prior noise distributions (e.g., Gaussianity). Contribution/Results: It introduces a novel joint optimization paradigm unifying fractional-order system identification and control, and establishes the first sample-complexity bound for FOLOC, quantifying the data requirement to achieve near-optimal control in realistic settings. Experiments demonstrate that FOLOC achieves high-fidelity modeling of fractional-order dynamics across multiple benchmark tasks, significantly enhancing control robustness and cross-scenario generalization capability.

Technology Category

Application Category

📝 Abstract
Integer-order calculus often falls short in capturing the long-range dependencies and memory effects found in many real-world processes. Fractional calculus addresses these gaps via fractional-order integrals and derivatives, but fractional-order dynamical systems pose substantial challenges in system identification and optimal control due to the lack of standard control methodologies. In this paper, we theoretically derive the optimal control via extit{linear quadratic regulator} (LQR) for extit{fractional-order linear time-invariant }(FOLTI) systems and develop an end-to-end deep learning framework based on this theoretical foundation. Our approach establishes a rigorous mathematical model, derives analytical solutions, and incorporates deep learning to achieve data-driven optimal control of FOLTI systems. Our key contributions include: (i) proposing an innovative system identification method control strategy for FOLTI systems, (ii) developing the first end-to-end data-driven learning framework, extbf{F}ractional- extbf{O}rder extbf{L}earning for extbf{O}ptimal extbf{C}ontrol (FOLOC), that learns control policies from observed trajectories, and (iii) deriving a theoretical analysis of sample complexity to quantify the number of samples required for accurate optimal control in complex real-world problems. Experimental results indicate that our method accurately approximates fractional-order system behaviors without relying on Gaussian noise assumptions, pointing to promising avenues for advanced optimal control.
Problem

Research questions and friction points this paper is trying to address.

Non-Markovian optimal control challenges
Fractional-order dynamical system identification
End-to-end deep learning framework
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fractional-order linear time-invariant systems
End-to-end deep learning framework
Data-driven optimal control strategy
🔎 Similar Papers
2024-07-09Neural Information Processing SystemsCitations: 3
Xiaole Zhang
Xiaole Zhang
Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA 90089, USA
P
Peiyu Zhang
Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA 90089, USA
X
Xiongye Xiao
Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA 90089, USA
Shixuan Li
Shixuan Li
ECE PhD Student, University of Southern California
Machine LearningAI4Science
Vasileios Tzoumas
Vasileios Tzoumas
Assistant Professor, University of Michigan
Multi-agent systemsControlPerceptionRoboticsCombinatorial optimization
V
Vijay Gupta
Elmore Family School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, USA
Paul Bogdan
Paul Bogdan
Associate Professor, University of Southern California
Network PhysiologyCyber-Physical SystemsBrain & data scienceComputational ScienceNetworks