Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

Derong Liu,Qinglai Wei,Pengfei Yan

doi:10.1109/tsmc.2015.2417510

Abstract

This paper is concerned with a novel generalized policy iteration algorithm for solving optimal control problems for discrete-time nonlinear systems. The idea is to use an iterative adaptive dynamic programming algorithm to obtain iterative control laws which make the iterative value functions converge to the optimum. Initialized by an admissible control law, it is shown that the iterative value functions are monotonically nonincreasing and converge to the optimal solution of Hamilton-Jacobi-Bellman equation, under the assumption that a perfect function approximation is employed. The admissibility property is analyzed, which shows that any of the iterative control laws can stabilize the nonlinear system. Neural networks are utilized to implement the generalized policy iteration algorithm, by approximating the iterative value function and computing the iterative control law, respectively, to achieve approximate optimal control. Finally, numerical examples are presented to verify the effectiveness of the present generalized policy iteration algorithm.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society

Lead the way for us

Journal: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society	Publication Date: Dec 1, 2015
Citations: 111

Similar Papers

A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm
Qiao Lin ... Derong Liu
International journal of systems science | VOL. 48
Qiao Lin, et. al.Qiao Lin ... Derong Liu
24 May 2016
International journal of systems science | VOL. 48

Generalized policy iteration adaptive dynamic programming algorithm for optimal tracking control of a class of nonlinear systems
Qiao Lin ... Derong Liu
-
Qiao Lin, et. al.Qiao Lin ... Derong Liu
01 May 2016
01 May 2016

Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Admissibility and Termination Analysis.
Qinglai Wei ... Qiao Lin
IEEE transactions on neural networks | VOL. 28
Qinglai Wei, et. al.Qinglai Wei ... Qiao Lin
01 Nov 2017
IEEE transactions on neural networks | VOL. 28

Discrete-Time Impulsive Adaptive Dynamic Programming.
Qinglai Wei ... Ruizhuo Song
IEEE transactions on cybernetics | VOL. 50
Qinglai Wei, et. al.Qinglai Wei ... Ruizhuo Song
11 Apr 2019
IEEE transactions on cybernetics | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society