A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents

Zhen Zhang,Tingting Song,Dongqing Wang,Qiaoni Han,Dongbin Zhao

doi:10.1109/access.2018.2878853

Abstract

Multi-agent reinforcement learning (MARL) can be used to design intelligent agents for solving cooperative tasks. Within the MARL category, this paper proposes the probability of maximal reward based on the infinitesimal gradient ascent (PMR-IGA) algorithm to reach the maximal total reward in repeated games. Theoretical analyses show that in a finite-player-finite-action repeated game with two pure optimal joint actions where no common component action exists, both the optimal joint actions are stable critical points of the PMR-IGA model. Furthermore, we apply the Q-value function to estimate the gradient and derive the probability of maximal reward based on estimated gradient ascent (PMR-EGA) algorithm. Theoretical analyses and simulations of case studies of repeated games show that the maximal total reward can be achieved under any initial conditions. The PMR-EGA can be naturally extended to optimize cooperative stochastic games. Two stochastic games, i.e., box pushing and a distributed sensor network, are used as test beds. The simulations show that the PMR-EGA displays consistently an excellent performance for both stochastic games.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 32	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A Cooperative Multi-Agent Reinforcement Learning Method Based on Coordination Degree
Haoyan Cui ... Zhen Zhang
IEEE Access | VOL. 9
Haoyan Cui, et. al.Haoyan Cui ... Zhen Zhang
01 Jan 2020
IEEE Access | VOL. 9

WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
Hui Liu ... Zhen Zhang
IEEE Access | VOL. 8
Hui Liu, et. al.Hui Liu ... Zhen Zhang
01 Jan 2020
IEEE Access | VOL. 8

Learning Automata-Based Multiagent Reinforcement Learning for Optimization of Cooperative Tasks.
Zhen Zhang ... Junwei Gao
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32
Zhen Zhang, et. al.Zhen Zhang ... Junwei Gao
07 Oct 2020
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32

Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
Chao Li ... Tangjie Lv
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Chao Li, et. al.Chao Li ... Tangjie Lv
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents

Abstract

Talk to us

Similar Papers

More From: IEEE Access