A TD3-based multi-agent deep reinforcement learning method in mixed cooperation-competition environment

Fengjiao Zhang,Jie Li,Zhi Li

doi:10.1016/j.neucom.2020.05.097

Abstract

We explored the problem about function approximation error and complex mission adaptability in multi-agent deep reinforcement learning. This paper proposes a new multi-agent deep reinforcement learning algorithm framework named multi-agent time delayed deep deterministic policy gradient. Our work reduces the overestimation error of neural network approximation and variance of estimation result using dual-centered critic, group target network smoothing and delayed policy updating. According to experiment results, it improves the ability to adapt complex missions eventually. Then, we discuss that there is an inevitable overestimation issue about existing multi-agent algorithms about approximating real action-value equations with neural network. We also explain the approximate error of equations in the multi-agent deep deterministic policy gradient algorithm mathematically and experimentally. Finally, the application of our algorithm in the mixed cooperative competition experimental environment further demonstrates the effectiveness and generalization of our algorithm, especially improving the group’s ability of adapting complex missions and completing more difficult missions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A TD3-based multi-agent deep reinforcement learning method in mixed cooperation-competition environment

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jun 12, 2020
Citations: 53

Similar Papers

Multi-agent Deep Reinforcement Learning based on Maximum Entropy
Zihao Wang ... Yanxin Zhang
-
Zihao Wang, et. al.Zihao Wang ... Yanxin Zhang
18 Jun 2021
18 Jun 2021

Deep deterministic policy gradient algorithm for crowd-evacuation path planning
Xinjin Li ... Yan Li
Computers & Industrial Engineering | VOL. 161
Xinjin Li, et. al.Xinjin Li ... Yan Li
13 Aug 2021
Computers & Industrial Engineering | VOL. 161

Simulation of strategic bidding for battery storage and e-mobility in local flexibility markets with multi-agent reinforcement learning
J Tran ... L Gajewski
IET Conference Proceedings | VOL. 2022
J Tran, et. al.J Tran ... L Gajewski
15 Jun 2022
IET Conference Proceedings | VOL. 2022

The Study of Crash-Tolerant, Multi-Agent Offensive and Defensive Games Using Deep Reinforcement Learning
Xilun Li ... Xiaolong Zheng
Electronics | VOL. 12
Xilun Li, et. al.Xilun Li ... Xiaolong Zheng
08 Jan 2023
Electronics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A TD3-based multi-agent deep reinforcement learning method in mixed cooperation-competition environment

Abstract

Talk to us

Similar Papers

More From: Neurocomputing