Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis

Qinglai Wei,Ruizhuo Song,Frank L Lewis,Derong Liu,Hanquan Lin

doi:10.1109/tsmc.2016.2623766

Abstract

In this paper, convergence properties are established for the newly developed discrete-time local value iteration adaptive dynamic programming (ADP) algorithm. The present local iterative ADP algorithm permits an arbitrary positive semidefinite function to initialize the algorithm. Employing a state-dependent learning rate function, for the first time, the iterative value function and iterative control law can be updated in a subset of the state space instead of the whole state space, which effectively relaxes the computational burden. A new analysis method for the convergence property is developed to prove that the iterative value functions will converge to the optimum under some mild constraints. Monotonicity of the local value iteration ADP algorithm is presented, which shows that under some special conditions of the initial value function and the learning rate function, the iterative value function can monotonically converge to the optimum. Finally, three simulation examples and comparisons are given to illustrate the performance of the developed algorithm.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society

Lead the way for us

Journal: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society	Publication Date: Jun 1, 2018
Citations: 153

Similar Papers

Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Admissibility and Termination Analysis.
Qinglai Wei ... Qiao Lin
IEEE transactions on neural networks | VOL. 28
Qinglai Wei, et. al.Qinglai Wei ... Qiao Lin
01 Nov 2017
IEEE transactions on neural networks | VOL. 28

Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
Qinglai Wei ... Derong Liu
IEEE transactions on cybernetics | VOL. 46
Qinglai Wei, et. al.Qinglai Wei ... Derong Liu
02 Nov 2015
IEEE transactions on cybernetics | VOL. 46

Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming.
Qinglai Wei ... Ruizhuo Song
IEEE transactions on cybernetics | VOL. 47
Qinglai Wei, et. al.Qinglai Wei ... Ruizhuo Song
18 Jul 2016
IEEE transactions on cybernetics | VOL. 47

Neuro-Optimal Control for Discrete Stochastic Processes via a Novel Policy Iteration Algorithm
Mingming Liang ... Ding Wang
IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society | VOL. 50
Mingming Liang, et. al.Mingming Liang ... Ding Wang
29 May 2019
29 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society