Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

Dong Xie,Xiangnan Zhong

doi:10.1109/eit.2019.8833742

Abstract

This paper proposes an intelligent multi-agent approach in a real-time strategy game, StarCraft, based on the deep deterministic policy gradients (DDPG) techniques. An actor and a critic network are established to estimate the optimal control actions and corresponding value functions, respectively. A special reward function is designed based on the agents’ own condition and enemies’ information to help agents make intelligent control in the game. Furthermore, in order to accelerate the learning process, the transfer learning techniques are integrated into the training process. Specifically, the agents are trained initially in a simple task to learn the basic concept for the combat, such as detouring moving, avoiding and joining attacking. Then, we transfer this experience to the target task with a complex and difficult scenario. From the experiment, it is shown that our proposed algorithm with transfer learning can achieve better performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning
Kun Shao ... Yuanheng Zhu
IEEE Transactions on Emerging Topics in Computational Intelligence | VOL. 3
Kun Shao, et. al.Kun Shao ... Yuanheng Zhu
01 Feb 2019
IEEE Transactions on Emerging Topics in Computational Intelligence | VOL. 3

EXPERIMENTS WITH ONLINE REINFORCEMENT LEARNING IN REAL-TIME STRATEGY GAMES
Kresten Toftgaard Andersen ... Dung Tran
Applied Artificial Intelligence | VOL. 23
Kresten Toftgaard Andersen, et. al.Kresten Toftgaard Andersen ... Dung Tran
22 Oct 2009
Applied Artificial Intelligence | VOL. 23

A Spectrum Handoff Method Based on Reinforcement and Transfer Learning
Jiaxing Zhao ... Fuchang Li
-
Jiaxing Zhao, et. al.Jiaxing Zhao ... Fuchang Li
01 Aug 2020
01 Aug 2020

Tank War Using Online Reinforcement Learning
Kresten Toftgaard Andersen ... Dung Tran
-
Kresten Toftgaard Andersen, et. al.Kresten Toftgaard Andersen ... Dung Tran
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

Abstract

Talk to us

Similar Papers