Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings

📅 2024-07-23

🏛️ arXiv.org

📈 Citations: 2

✨ Influential: 0

career value

208K/year

🤖 AI Summary

This paper addresses the optimal feedback control problem for stochastic control-affine systems with unknown nonlinear dynamics and stage-cost functions, leveraging only known control penalty terms and constraints. We propose a fully data-driven framework that innovatively applies kernel mean embedding (KME) to nonparametrically identify the Markov transition operator of controlled diffusion processes, and integrates convex operator theory to reformulate the Hamilton–Jacobi–Bellman (HJB) equation—thereby circumventing the curse of dimensionality inherent in classical dynamic programming. Our approach operates entirely within reproducing kernel Hilbert spaces, utilizing kernel methods and convex optimization without requiring model assumptions or function approximators. Evaluated on multiple high-dimensional nonlinear stochastic systems, the method demonstrates superior data efficiency and scalability. It establishes a novel paradigm for real-time optimal control of black-box systems.

Technology Category

Application Category

📝 Abstract

This paper proposes a fully data-driven approach for optimal control of nonlinear control-affine systems represented by a stochastic diffusion. The focus is on the scenario where both the nonlinear dynamics and stage cost functions are unknown, while only control penalty function and constraints are provided. Leveraging the theory of reproducing kernel Hilbert spaces, we introduce novel kernel mean embeddings (KMEs) to identify the Markov transition operators associated with controlled diffusion processes. The KME learning approach seamlessly integrates with modern convex operator-theoretic Hamilton-Jacobi-Bellman recursions. Thus, unlike traditional dynamic programming methods, our approach exploits the ``kernel trick'' to break the curse of dimensionality. We demonstrate the effectiveness of our method through numerical examples, highlighting its ability to solve a large class of nonlinear optimal control problems.

Problem

Research questions and friction points this paper is trying to address.

Data-driven control of nonlinear stochastic systems with unknown dynamics

Learning Markov operators via RKHS embeddings for diffusion processes

Solving high-dimensional nonlinear optimal control problems with convex formulations

Innovation

Methods, ideas, or system contributions that make the work stand out.

RKHS embedding of state probability densities

Operator regression for Markov transition identification

Convex operator-theoretic HJB recursions for control

🔎 Similar Papers

Unsupervised Machine Learning Hybrid Approach Integrating Linear Programming in Loss Function: A Robust Optimization Technique