Off-Policy Q-Learning for Anti-Interference Control of Multi-Player Systems

Jinna Li,Frank L Lewis,Zhenfei Xiao,Sarangapani Jagannathan,Tianyou Chai

doi:10.1016/j.ifacol.2020.12.2180

Jinna Li, Frank L Lewis + Show 3 more

Open Access

https://doi.org/10.1016/j.ifacol.2020.12.2180

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

This paper develops a novel off-policy game Q-learning algorithm to solve the anti-interference control problem for discrete-time linear multi-player systems using only data without requiring system matrices to be known. The primary contribution of this paper lies in that the Q-learning strategy employed in the proposed algorithm is implemented in an off-policy policy iteration approach other than on-policy learning due to the well-known advantages of off-policy Q-learning over on-policy Q-learning. All of the players work hard together for the goal of minimizing their common performance index meanwhile defeating the disturbance that tries to maximize the specific performance index, and finally they reach the Nash equilibrium of the game resulting in satisfying disturbance attenuation condition. In order to find the solution to the Nash equilibrium, the anti-interference control problem is first transformed into an optimal control problem. Then an off-policy Q-learning algorithm is proposed in the framework of typical adaptive dynamic programming (ADP) and game architecture, such that control policies of all players can be learned using only measured data. Comparative simulation results are provided to verify the effectiveness of the proposed method.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Off-Policy Q-Learning for Anti-Interference Control of Multi-Player Systems

Abstract

Published Version

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Journal: IFAC PapersOnLine	Publication Date: Jan 1, 2020
Citations: 1

Similar Papers

H∞ Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning
Jinna Li ... Zhenfei Xiao
IEEE Access | VOL. 8
Jinna Li, et. al.Jinna Li ... Zhenfei Xiao
01 Jan 2020
IEEE Access | VOL. 8

$H_{\infty}$ Tracking Control of Unknown Discrete- Time Linear Systems via Output-Data-Driven Off-policy Q-learning Algorithm
Kun Zhang ... Xuantong Liu
-
Kun Zhang, et. al.Kun Zhang ... Xuantong Liu
25 Jul 2022
25 Jul 2022

H∞ Control for Discrete-time Linear Systems by Integrating Off-policy Q-learning and Zero-sum Game
Jinna Li ... Hong Niu
-
Jinna Li, et. al.Jinna Li ... Hong Niu
01 Jun 2018
01 Jun 2018

A nearly optimal control approach for uncertainty input-delay systems based on adaptive dynamic programming
Yu-Chen Lin ... Hsin-Chang Chen
-
Yu-Chen Lin, et. al.Yu-Chen Lin ... Hsin-Chang Chen
01 Nov 2017
01 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Off-Policy Q-Learning for Anti-Interference Control of Multi-Player Systems

Abstract

Published Version

Talk to us

Similar Papers

More From: IFAC PapersOnLine