Efficient off‐policy Q‐learning for multi‐agent systems by solving dual games

Yan Wang,Jiwei Wen,Huiwen Xue,Jinfeng Liu,Xiaoli Luan

doi:10.1002/rnc.7189

Abstract

AbstractThis article develops distributed optimal control policies via Q‐learning for multi‐agent systems (MASs) by solving dual games. According to game theory, first, the distributed consensus problem is formulated as a multi‐player non‐zero‐sum game, where each agent is viewed as a player focusing only on its local performance and the whole MAS achieves Nash equilibrium. Second, for each agent, the anti‐disturbance problem is formulated as a two‐player zero‐sum game, in which the control input and external disturbance are a pair of opponents. Specifically, (1) an offline data‐driven off‐policy for distributed tracking algorithm based on momentum policy gradient (MPG) is developed, which can effectively achieve consensus of MASs with guaranteed ‐bounded synchronization error. (2) An actor‐critic‐disturbance neural network is employed to implement the MPG algorithm and obtain optimal policies. Finally, numerical and practical simulation results are conducted to verify the effectiveness of the developed tracking policies via MPG algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient off‐policy Q‐learning for multi‐agent systems by solving dual games

Abstract

Talk to us

Similar Papers

More From: International Journal of Robust and Nonlinear Control

Lead the way for us

Similar Papers

Differential graphical games of multiagent systems with nonzero leader's control input and external disturbances
Guan Huang ... Xinxin Guo
International Journal of Robust and Nonlinear Control | VOL. 34
Guan Huang, et. al.Guan Huang ... Xinxin Guo
18 Apr 2024
International Journal of Robust and Nonlinear Control | VOL. 34

Learning‐based robust control methodologies under information constraints
Hamid Reza Karimi ... Ning Wang
International Journal of Robust and Nonlinear Control | VOL. 32
Hamid Reza Karimi, et. al.Hamid Reza Karimi ... Ning Wang
26 Jan 2022
International Journal of Robust and Nonlinear Control | VOL. 32

Fixed-time Leader-follower Consensus of Multi-agent Systems With Nonzero Leader's Control Input
Donglin Wang ... Xiangyong Chen
-
Donglin Wang, et. al.Donglin Wang ... Xiangyong Chen
08 May 2023
08 May 2023

Distributed Adaptive-Neural Finite-Time Consensus Control for Stochastic Nonlinear Multiagent Systems Subject to Saturated Inputs.
Fatemeh Sedghi ... Shen Yin
IEEE Transactions on Neural Networks and Learning Systems | VOL. PP
Fatemeh Sedghi, et. al.Fatemeh Sedghi ... Shen Yin
01 Oct 2023
IEEE Transactions on Neural Networks and Learning Systems | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient off‐policy Q‐learning for multi‐agent systems by solving dual games

Abstract

Talk to us

Similar Papers

More From: International Journal of Robust and Nonlinear Control