Stability and Admissibility Analysis for Zero-Sum Games Under General Value Iteration Formulation.

Ding Wang,Junfei Qiao,Mingming Ha,Mingming Zhao

doi:10.1109/tnnls.2022.3152268

Abstract

In this article, the general value iteration (GVI) algorithm for discrete-time zero-sum games is investigated. The theoretical analysis focuses on stability properties of the systems and also the admissibility properties of the iterative policy pair. A new criterion is established to determine the admissibility of the current policy pair. Besides, based on the admissibility criterion, the improved GVI algorithm toward zero-sum games is developed to guarantee that all iterative policy pairs are admissible if the current policy pair satisfies the criterion. On the basis of the attraction domain, we demonstrate that the state trajectory will stay in the region using the fixed or the evolving policy pair if the initial state belongs to the domain. It is emphasized that the evolving policy pair can stabilize the controlled system. These theoretical results are applied to linear and nonlinear systems via offline and online critic control design.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stability and Admissibility Analysis for Zero-Sum Games Under General Value Iteration Formulation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Nov 1, 2023
Citations: 18

Similar Papers

Finite Approximation Error-Based Value Iteration ADP
Derong Liu ... Qinglai Wei
-
Derong Liu, et. al.Derong Liu ... Qinglai Wei
01 Jan 2017
01 Jan 2017

Adaptive dynamic programming-based optimal tracking control for nonlinear systems using general value iteration
Xiaofeng Lin ... Chunning Song
-
Xiaofeng Lin, et. al.Xiaofeng Lin ... Chunning Song
01 Dec 2014
01 Dec 2014

Advanced Optimal Tracking Control With Stability Guarantee via Novel Value Learning Formulation.
Ding Wang ... Menghua Li
IEEE transactions on neural networks and learning systems | VOL. 35
Ding Wang, et. al.Ding Wang ... Menghua Li
01 Jun 2024
IEEE transactions on neural networks and learning systems | VOL. 35

Learning Algorithms for Differential Games of Continuous-Time Systems
Derong Liu ... Xiong Yang
-
Derong Liu, et. al.Derong Liu ... Xiong Yang
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stability and Admissibility Analysis for Zero-Sum Games Under General Value Iteration Formulation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems