The Study of Crash-Tolerant, Multi-Agent Offensive and Defensive Games Using Deep Reinforcement Learning

Xilun Li,Zhan Li,Xuebo Yang,Xinghu Yu,Xiaolong Zheng

doi:10.3390/electronics12020327

Abstract

In the multi-agent offensive and defensive game (ODG), each agent achieves its goal by cooperating or competing with other agents. The multi-agent deep reinforcement learning (MADRL) method is applied in similar scenarios to help agents make decisions. In various situations, the agents of both sides may crash due to collisions. However, the existing algorithms cannot deal with the situation where the number of agents reduces. Based on the multi-agent deep deterministic policy gradient (MADDPG) algorithm, we study a method to deal with a reduction in the number of agents in the training process without changing the structure of the neural network (NN), which is called the frozen agent method for the MADDPG (FA-MADDPG) algorithm. In addition, we design a distance–collision reward function to help agents learn strategies better. Through the experiments in four scenarios with different numbers of agents, it is verified that the algorithm we proposed can not only successfully deal with the problem of agent number reduction in the training stage but also show better performance and higher efficiency than the MADDPG algorithm in simulation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Jan 8, 2023
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Study of Crash-Tolerant, Multi-Agent Offensive and Defensive Games Using Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Scalable coordinated management of peer-to-peer energy trading: A multi-cluster deep reinforcement learning approach
Dawei Qiu ... Goran Strbac
Applied Energy | VOL. 292
Dawei Qiu, et. al.Dawei Qiu ... Goran Strbac
13 Apr 2021
Applied Energy | VOL. 292

Deep deterministic policy gradient algorithm for crowd-evacuation path planning
Xinjin Li ... Yan Li
Computers & Industrial Engineering | VOL. 161
Xinjin Li, et. al.Xinjin Li ... Yan Li
13 Aug 2021
Computers & Industrial Engineering | VOL. 161

Bi-directional Deep Transfer Learning for RIS-enhanced Multi-cell OFDMA Systems
Gaoxiang Sun ... Youyun Xu
-
Gaoxiang Sun, et. al.Gaoxiang Sun ... Youyun Xu
01 Nov 2022
01 Nov 2022

Multi-agent Reinforcement Learning for a Special Formation Problem
Changsheng Qu ... Liangjun Ke
-
Changsheng Qu, et. al.Changsheng Qu ... Liangjun Ke
29 Jul 2022
29 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Study of Crash-Tolerant, Multi-Agent Offensive and Defensive Games Using Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Electronics