A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning

Yu Sun,Jun Lai,Zhixiong Xu,Yue Xu,Lei Cao,Xiliang Chen

doi:10.1109/access.2020.3011670

Yu Sun, Jun Lai + Show 4 more

Open Access

https://doi.org/10.1109/access.2020.3011670

Copy DOI

Abstract

Multi-agent deep reinforcement learning (MDRL) is an emerging research hotspot and application direction in the field of machine learning and artificial intelligence. MDRL covers many algorithms, rules and frameworks, it is currently researched in swarm system, energy allocation optimization, stocking analysis, sequential social dilemma, and with extremely bright future. In this paper, a parallel-critic method based on classic MDRL algorithm MADDPG is proposed to alleviate the training instability problem in cooperative-competitive multi-agent environment. Furthermore, a policy smoothing technique is introduced to our proposed method to decrease the variance of learning policies. The suggested method is evaluated in three different scenarios of authoritative multi-agent particle environment (MPE). Multiple statistical data of experimental results show that our method significantly improves the training stability and performance compared to vanilla MADDPG.

Highlights

Reinforcement learning (RL) [1] is an important branch of machine learning
1) EVALUATION OF TRAINING STABILITY OF ALGORITHM Figure 5 shows the mean episode reward curves of the original Multi-agent Deep Deterministic Policy Gradient (MADDPG) and MADDPG-PC (U = 2,3,4) in three scenarios after 60000 episodes of training, we can infer that after 30000 episodes the curves tend to be stable in each scenario, so our calculation is based on testing data after 30000 episodes
We can conclude that MADDPG-PC can stabilize the training process to a significant degree by parallel critics training simultaneously, we can speculate that most of the time the stability improves with the increasing number of critics because more critics can supply more stable policies

Summary

Introduction

Reinforcement learning (RL) [1] is an important branch of machine learning. The essence of RL is agents learning policies in the interaction process with the environment to maximize returns or achieve specific goals. Instead of guiding the agent on how to make actions correctly in supervised learning, RL usually evaluates and corrects the action selection based on the feedback signal from the environment. RL is more suitable for solving complicated decision-making problems due to easier reward function design and lower information requirements. Deep reinforcement learning (DRL) [2] that combines deep neural networks (DNN) with traditional RL methods has become a research hotspot and made tremendous breakthroughs in the computer vision system, robot control, large-scale real-time strategic games, etc.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multi-agent Deep Reinforcement Learning based on Maximum Entropy
Zihao Wang ... Chenkun Yin
-
Zihao Wang, et. al.Zihao Wang ... Chenkun Yin
18 Jun 2021
18 Jun 2021

Multi-Agent Reinforcement Learning Based Fully Decentralized Dynamic Time Division Configuration for 5G and B5G Network.
Xiangyu Chen ... Gang Chuai
Sensors (Basel, Switzerland) | VOL. 22
Xiangyu Chen, et. al.Xiangyu Chen ... Gang Chuai
23 Feb 2022
Sensors (Basel, Switzerland) | VOL. 22

Simulating Human-like Navigation in Urban Transportation Environments with Multi-Agent Deep (Inverse) Reinforcement Learning

Turkish Journal of Computer and Mathematics Education (TURCOMAT) | VOL. 12

19 Apr 2021
Turkish Journal of Computer and Mathematics Education (TURCOMAT) | VOL. 12

MAT-DQN: Toward Interpretable Multi-agent Deep Reinforcement Learning for Coordinated Activities
Yoshinari Motokawa ... Toshiharu Sugawara
-
Yoshinari Motokawa, et. al.Yoshinari Motokawa ... Toshiharu Sugawara
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access