H∞  Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning

Jinna Li,Zhenfei Xiao

doi:10.1109/access.2020.2970760

H∞ Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning

Jinna Li, Zhenfei Xiao

Open Access

https://doi.org/10.1109/access.2020.2970760

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 33	License type: CC BY 4.0

Affiliation: Northeastern University, Liaoning Shihua University

Abstract

This paper presents a novel off-policy game Q-learning algorithm to solve $H_\infty $ control problem for discrete-time linear multi-player systems with completely unknown system dynamics. The primary contribution of this paper lies in that the Q-learning strategy employed in the proposed algorithm is implemented in an off-policy policy iteration approach other than on-policy learning, since the off-policy learning has some well-known advantages over the on-policy learning. All of players struggle together to minimize their common performance index meanwhile defeating the disturbance that tries to maximize the specific performance index, and finally they reach the Nash equilibrium of game resulting in satisfying disturbance attenuation condition. For finding the solution of the Nash equilibrium, $H_\infty $ control problem is first transformed into an optimal control problem. Then an off-policy Q-learning algorithm is put forward in the typical adaptive dynamic programming (ADP) and game architecture, such that control policies of all players can be learned using only measured data. More importantly, the rigorous proof of no bias of solution to the Nash equilibrium by using the proposed off-policy game Q-learning algorithm is presented. Comparative simulation results are provided to verify the effectiveness and demonstrate the advantages of the proposed method.

Highlights

The H∞ control is a robust control method which is aimed at designing the controllers to attenuate the negative effects in performance of dynamical systems caused by external disturbances guarantee the stability of systems if no disturbance exists [1]–[3]
By reviewing the existing results on H∞ control for dynamical systems, it is not difficult to find that most of researchers are concerned about model-based H∞ controller design using the variety of methods, such as linear matrix inequality (LMI) [10]–[12], zero-sum game [13]–[17] and pole assignment [18]–[20], etc
In view of the advantages of off-policy learning over on-policy learning shown in our previous result [37] wherein the off -policy Q-learning method was proposed for multi-player systems without the consideration of disturbance, developing an off-policy game Q-learning algorithm to solve H∞ control problem for discrete-time linear multi-player systems using only measured data becomes our target

Summary

INTRODUCTION

The H∞ control is a robust control method which is aimed at designing the controllers to attenuate the negative effects in performance of dynamical systems caused by external disturbances guarantee the stability of systems if no disturbance exists [1]–[3]. By reviewing the existing results on H∞ control for dynamical systems, it is not difficult to find that most of researchers are concerned about model-based H∞ controller design using the variety of methods, such as linear matrix inequality (LMI) [10]–[12], zero-sum game [13]–[17] and pole assignment [18]–[20], etc. In view of the advantages of off-policy learning over on-policy learning shown in our previous result [37] wherein the off -policy Q-learning method was proposed for multi-player systems without the consideration of disturbance, developing an off-policy game Q-learning algorithm to solve H∞ control problem for discrete-time linear multi-player systems using only measured data becomes our target. The superscript T is used for the transpose. ⊗ stands for the Kronecker product. vec(L) is used to turn any matrix L into a single column vector

PROBLEM STATEMENT

QUADRATIC FORM PROOF OF VALUE

ON-POLICY GAME Q-LEARNING ALGORITHM

1: Data collection

SIMULATION RESULTS

COMPARISON RESULTS OF ON-POLICY LEARNING WITH OFF-POLICY LEARNING

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

H∞ Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Off-Policy Q-Learning for Anti-Interference Control of Multi-Player Systems
Jinna Li ... Zhenfei Xiao
IFAC-PapersOnLine | VOL. 53
Jinna Li, et. al.Jinna Li ... Zhenfei Xiao
01 Jan 2020
IFAC-PapersOnLine | VOL. 53

H∞ Control for Discrete-time Linear Systems by Integrating Off-policy Q-learning and Zero-sum Game
Jinna Li ... Hong Niu
-
Jinna Li, et. al.Jinna Li ... Hong Niu
01 Jun 2018
01 Jun 2018

$H_{\infty}$ Tracking Control of Unknown Discrete- Time Linear Systems via Output-Data-Driven Off-policy Q-learning Algorithm
Kun Zhang ... Yunjian Peng
-
Kun Zhang, et. al.Kun Zhang ... Yunjian Peng
25 Jul 2022
25 Jul 2022

Reinforcement learning for natural gas pipeline pressure control
Zhaowei Yang ... Jinna Li
-
Zhaowei Yang, et. al.Zhaowei Yang ... Jinna Li
27 Jun 2022
27 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

H∞ Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access