Reward-Reinforced Generative Adversarial Networks for Multi-Agent Systems

Changgang Zheng,Nelly Bencomo,Antonio Garcia-Dominguez,Juan Marcelo Parra-Ullauri,Shufan Yang

doi:10.1109/tetci.2021.3082204

Abstract

Multi-agent systems deliver highly resilient and adaptable solutions for common problems in telecommunications, aerospace, and industrial robotics. However, achieving an optimal global goal remains a persistent obstacle for collaborative multi-agent systems, where learning affects the behaviour of more than one agent. A number of nonlinear function approximation methods have been proposed for solving the Bellman equation, which describe a recursive format of an optimal policy. However, how to leverage the value distribution based on reinforcement learning, and how to improve the efficiency and efficacy of such systems remain a challenge. In this work, we developed a reward-reinforced generative adversarial network to represent the distribution of the value function, replacing the approximation of Bellman updates. We demonstrated our method is resilient and outperforms other conventional reinforcement learning methods. This method is also applied to a practical case study: maximising the number of user connections to autonomous airborne base stations in a mobile communication network. Our method maximises the data likelihood using a cost function under which agents have optimal learned behaviours. This reward-reinforced generative adversarial network can be used as a generic framework for multi-agent learning at the system level.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Emerging Topics in Computational Intelligence	Publication Date: Jun 1, 2022
Citations: 5	License type: other-oa

R Discovery Prime

R Discovery Prime

Reward-Reinforced Generative Adversarial Networks for Multi-Agent Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Emerging Topics in Computational Intelligence

Lead the way for us

Similar Papers

An efficient reinforcement learning scheme for the confinement escape problem
Vignesh Gurumurthy ... Narasimhan Sundararajan
Applied Soft Computing | VOL. 152
Vignesh Gurumurthy, et. al.Vignesh Gurumurthy ... Narasimhan Sundararajan
11 Jan 2024
Applied Soft Computing | VOL. 152

Incremental Sparse Bayesian Method for Online Dialog Strategy Learning
Sungjin Lee ... Maxine Eskenazi
IEEE Journal of Selected Topics in Signal Processing | VOL. 6
Sungjin Lee, et. al.Sungjin Lee ... Maxine Eskenazi
01 Dec 2012
IEEE Journal of Selected Topics in Signal Processing | VOL. 6

An emotional model embedded reinforcement learning system
Masanao Obayashi ... Takashi Kuremoto
-
Masanao Obayashi, et. al.Masanao Obayashi ... Takashi Kuremoto
01 Oct 2012
01 Oct 2012

A state predictor‐based reinforcement learning system
Kunikazu Kobayashi ... Takashi Kuremoto
Electronics and Communications in Japan | VOL. 93
Kunikazu Kobayashi, et. al.Kunikazu Kobayashi ... Takashi Kuremoto
14 May 2010
Electronics and Communications in Japan | VOL. 93

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reward-Reinforced Generative Adversarial Networks for Multi-Agent Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Emerging Topics in Computational Intelligence