Convergence and Stability of Optimal Regulation via Generalized N-Step Value Gradient Learning.

Ding Wang,Mingming Zhao,Mingming Ha,Junfei Qiao

doi:10.1109/tnnls.2023.3245630

Abstract

In this article, the generalized N -step value gradient learning (GNSVGL) algorithm, which takes a long-term prediction parameter λ into account, is developed for infinite horizon discounted near-optimal control of discrete-time nonlinear systems. The proposed GNSVGL algorithm can accelerate the learning process of adaptive dynamic programming (ADP) and has a better performance by learning from more than one future reward. Compared with the traditional N -step value gradient learning (NSVGL) algorithm with zero initial functions, the proposed GNSVGL algorithm is initialized with positive definite functions. Considering different initial cost functions, the convergence analysis of the value-iteration-based algorithm is provided. The stability condition for the iterative control policy is established to determine the value of the iteration index, under which the control law can make the system asymptotically stable. Under such a condition, if the system is asymptotically stable at the current iteration, then the iterative control laws after this step are guaranteed to be stabilizing. Two critic neural networks and one action network are constructed to approximate the one-return costate function, the λ -return costate function, and the control law, respectively. It is emphasized that one-return and λ -return critic networks are combined to train the action neural network. Finally, via conducting simulation studies and comparisons, the superiority of the developed algorithm is confirmed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Convergence and Stability of Optimal Regulation via Generalized N-Step Value Gradient Learning.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Aug 1, 2024
Citations: 2

Similar Papers

Adaptive tracking control for a class of continuous-time uncertain nonlinear systems using the approximate solution of HJB equation
Chaoxu Mu ... Aiguo Song
Neurocomputing | VOL. 260
Chaoxu Mu, et. al.Chaoxu Mu ... Aiguo Song
10 May 2017
Neurocomputing | VOL. 260

Nonlinear neuro-optimal tracking control via stable iterative Q-learning algorithm
Qinglai Wei ... Qiuye Sun
Neurocomputing | VOL. 168
Qinglai Wei, et. al.Qinglai Wei ... Qiuye Sun
05 Jun 2015
Neurocomputing | VOL. 168

Costate-Supplement ADP for Model-Free Optimal Control of Discrete-Time Nonlinear Systems.
Jun Ye ... Biao Luo
IEEE transactions on neural networks and learning systems | VOL. PP
Jun Ye, et. al.Jun Ye ... Biao Luo
01 Jan 2024
IEEE transactions on neural networks and learning systems | VOL. PP

A New Constrained Cost Value Iteration for Optimal Control of Discrete-Time Nonlinear Systems
Tao Li ... Qinglai Wei
-
Tao Li, et. al.Tao Li ... Qinglai Wei
22 Oct 2021
22 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convergence and Stability of Optimal Regulation via Generalized N-Step Value Gradient Learning.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems