Q-Learning for Continuous-Time Linear Systems: A Data-Driven Implementation of the Kleinman Algorithm

Corrado Possieri,Mario Sassano

doi:10.1109/tsmc.2022.3145693

Abstract

A data-driven strategy to estimate the optimal feedback and the value function in an infinite-horizon, continuous-time, linear-quadratic optimal control problem for an unknown system is proposed. The method permits the construction of the optimal policy without any knowledge of the model, without requiring that the time derivatives of the state are available for the design, and without even assuming that an initial stabilizing feedback policy is available. Two alternative architectures are discussed: the first scheme revolves around the periodic computation of some matrix inversions involving the Q-function, whereas the second approach relies on a purely continuous-time implementation of some dynamic systems whose trajectories are uniformly attracted by the solutions to the above algebraic equations. Interestingly, the proposed strategy essentially constitutes a (direct) data-driven implementation of the celebrated Kleinman algorithm, hence subsuming the particularly appealing features of the latter, such as quadratic monotone convergence to the optimal solution. The theory is then validated by the means of practically motivated applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Q-Learning for Continuous-Time Linear Systems: A Data-Driven Implementation of the Kleinman Algorithm

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics: Systems

Lead the way for us

Journal: IEEE Transactions on Systems, Man, and Cybernetics: Systems	Publication Date: Oct 1, 2022
Citations: 10

Similar Papers

A revised Kleinman algorithm to solve algebraic Riccati equation of singularly perturbed systems
Hiroaki Mukaidani ... Koichi Mizukami
Automatica | VOL. 38
Hiroaki Mukaidani, et. al.Hiroaki Mukaidani ... Koichi Mizukami
02 Jan 2002
Automatica | VOL. 38

Open-loop and closed-loop solvabilities for stochastic linear quadratic optimal control problems of Markovian regime switching system
Xin Zhang ... Xun Li
ESAIM: Control, Optimisation and Calculus of Variations | VOL. 27
Xin Zhang, et. al.Xin Zhang ... Xun Li
01 Jan 2020
ESAIM: Control, Optimisation and Calculus of Variations | VOL. 27

Linear‐Quadratic Optimal Control of NCSs with Random Input Gains
-
-
--
28 Apr 2023
28 Apr 2023

Extended adaptive optimal control of linear systems with unknown dynamics using adaptive dynamic programming
Minggang Gan ... Chi Zhang
Asian Journal of Control | VOL. 23
Minggang Gan, et. al.Minggang Gan ... Chi Zhang
08 Oct 2019
Asian Journal of Control | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Q-Learning for Continuous-Time Linear Systems: A Data-Driven Implementation of the Kleinman Algorithm

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics: Systems