Adaptive Dynamic Programming for Nonlinear-Constrained H∞ Control

Xiong Yang,Mengmeng Xu,Qinglai Wei

doi:10.1109/tsmc.2023.3247888

Abstract

This article considers the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$H_{\infty}$</tex-math> </inline-formula> control problem of nonlinear systems having unavailable dynamics and asymmetric saturating actuators. Initially, such an <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$H_{\infty}$</tex-math> </inline-formula> control problem is converted into the zero-sum game with a nonquadratic cost function being introduced. Then, in order to solve the Hamilton–Jacobi–Isaacs equation arising in the zero-sum game, a simultaneous policy iteration (SPI) algorithm is developed under the adaptive dynamic programming framework. Meanwhile, it is proved that the convergence of the SPI algorithm in essence amounts to the convergence of the sequential PI algorithm. To implement the SPI algorithm, the critic, the actor, and the perturbation neural networks (NNs) are, respectively, constructed to estimate the cost function, the control policy, and the perturbation. The three NNs’ weights are simultaneously determined by using the least-squares method together with the Monte Carlo integration technique. A remarkable characteristic of such an SPI algorithm is that arbitrary control policies and perturbations are applicable in the learning process. This makes system’s information be able to be replaced by the data collected along system’s trajectories in advance. More importantly, the persistence of the excitation condition is not required. Finally, simulations of two nonlinear examples are given to validate the present SPI algorithm.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Dynamic Programming for Nonlinear-Constrained H∞ Control

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society

Lead the way for us

Journal: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society	Publication Date: Jul 1, 2023
Citations: 5

Similar Papers

Near-optimal control for continuous-time nonlinear systems with control constraints using on-line ADP
Chunbin Qin ... Yanhong Luo
-
Chunbin Qin, et. al.Chunbin Qin ... Yanhong Luo
01 Jun 2013
01 Jun 2013

Adaptive Dynamic Programming for Decentralized Stabilization of Uncertain Nonlinear Large-Scale Systems With Mismatched Interconnections
Xiong Yang ... Haibo He
IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society | VOL. 50
Xiong Yang, et. al.Xiong Yang ... Haibo He
01 Jan 2019
01 Jan 2019

Robust Control of Uncertain Nonlinear Systems Based on Adaptive Dynamic Programming
Jing Na ... Jun Zhao
-
Jing Na, et. al.Jing Na ... Jun Zhao
01 Jan 2017
01 Jan 2017

Further Results on Optimal Tracking Control for Nonlinear Systems With Nonzero Equilibrium via Adaptive Dynamic Programming.
Tong Wang ... Yujia Wang
IEEE transactions on neural networks | VOL. 34
Tong Wang, et. al.Tong Wang ... Yujia Wang
01 Apr 2023
IEEE transactions on neural networks | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Dynamic Programming for Nonlinear-Constrained H∞ Control

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society