Continuous-Time Time-Varying Policy Iteration.

Qinglai Wei,Zhanyu Yang,Zehua Liao,Benkai Li,Derong Liu

doi:10.1109/tcyb.2019.2926631

Abstract

A novel policy iteration algorithm, called the continuous-time time-varying (CTTV) policy iteration algorithm, is presented in this paper to obtain the optimal control laws for infinite horizon CTTV nonlinear systems. The adaptive dynamic programming (ADP) technique is utilized to obtain the iterative control laws for the optimization of the performance index function. The properties of the CTTV policy iteration algorithm are analyzed. Monotonicity, convergence, and optimality of the iterative value function have been analyzed, and the iterative value function can be proven to monotonically converge to the optimal solution of the Hamilton-Jacobi-Bellman (HJB) equation. Furthermore, the iterative control law is guaranteed to be admissible to stabilize the nonlinear systems. In the implementation of the presented CTTV policy algorithm, the approximate iterative control laws and iterative value function are obtained by neural networks. Finally, the numerical results are given to verify the effectiveness of the presented method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Continuous-Time Time-Varying Policy Iteration.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics

Lead the way for us

Journal: IEEE Transactions on Cybernetics	Publication Date: Dec 1, 2020
Citations: 83

Similar Papers

Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems
Derong Liu ... Qinglai Wei
-
Derong Liu, et. al.Derong Liu ... Qinglai Wei
01 Jan 2017
01 Jan 2017

Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems
Guangyu Zhu ... Peng Zhang
IEEE/CAA Journal of Automatica Sinica | VOL. 10
Guangyu Zhu, et. al.Guangyu Zhu ... Peng Zhang
01 Mar 2023
IEEE/CAA Journal of Automatica Sinica | VOL. 10

Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming.
Qinglai Wei ... Qiao Lin
IEEE Transactions on Cybernetics | VOL. 47
Qinglai Wei, et. al.Qinglai Wei ... Qiao Lin
18 Jul 2016
IEEE Transactions on Cybernetics | VOL. 47

A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm
Qiao Lin ... Derong Liu
International Journal of Systems Science | VOL. 48
Qiao Lin, et. al.Qiao Lin ... Derong Liu
24 May 2016
International Journal of Systems Science | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous-Time Time-Varying Policy Iteration.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics