Learning Macromanagement in Starcraft by Deep Reinforcement Learning.

Wenzhen Huang,Qiyue Yin,Junge Zhang,Kaiqi Huang

doi:10.3390/s21103332

Abstract

StarCraft is a real-time strategy game that provides a complex environment for AI research. Macromanagement, i.e., selecting appropriate units to build depending on the current state, is one of the most important problems in this game. To reduce the requirements for expert knowledge and enhance the coordination of the systematic bot, we select reinforcement learning (RL) to tackle the problem of macromanagement. We propose a novel deep RL method, Mean Asynchronous Advantage Actor-Critic (MA3C), which computes the approximate expected policy gradient instead of the gradient of sampled action to reduce the variance of the gradient, and encode the history queue with recurrent neural network to tackle the problem of imperfect information. The experimental results show that MA3C achieves a very high rate of winning, approximately 90%, against the weaker opponents and it improves the win rate about 30% against the stronger opponents. We also propose a novel method to visualize and interpret the policy learned by MA3C. Combined with the visualized results and the snapshots of games, we find that the learned macromanagement not only adapts to the game rules and the policy of the opponent bot, but also cooperates well with the other modules of MA3C-Bot.

Highlights

StarCraft is a Real-Time Strategy (RTS) game that was released by Blizzard Entertainment in 1998
The main contributions of this paper are: (i) we introduce the Reinforcement Learning (RL) method to solve the problem of macromanagement; (ii) we propose a novel deep RL method, Mean Asynchronous Advantage Actor-Critic (MA3C), which can solve the problem of imperfect information, the uncertainty of state transition, and the matter of long training time; and, (iii) we present an approach to visualize the policy learned by deep RL method
The main difference is the RL algorithms and the networks [16] selects Double Q-Learning to their network, while we propose a novel RL algorithm, MA3C, which runs multiple RL processes in parallel to reduce the training time and computes the approximate expected gradient to improve the stability of the algorithm

Summary

Introduction

StarCraft is a Real-Time Strategy (RTS) game that was released by Blizzard Entertainment in 1998. Similar to other RTS games, the core tasks in StarCraft are gathering resources, training the military, and using them to defeat the opponent’s army. The states of this game are partially observable. Reinforcement learning (RL) is a powerful tool to solve sequential decision-making problems in which an agent interacts with the environment over a number of discrete time steps. At each discrete time step t, the agent receives a state st from the environment, and it responds an action at selected from the action space A. The agent’s goal is maximizing the expected discounted return E( Rt ) through finding an optimal policy π ( a|st ).

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: May 11, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Learning Macromanagement in Starcraft by Deep Reinforcement Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle
Qilei Zhang ... Qixin Sha
IEEE access : practical innovations, open solutions | VOL. 8
Qilei Zhang, et. al.Qilei Zhang ... Qixin Sha
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 8

Continuous Control for Autonomous Underwater Vehicle Path Following Using Deep Interactive Reinforcement Learning
Qilei Zhang ... Guangliang Li
-
Qilei Zhang, et. al.Qilei Zhang ... Guangliang Li
01 Oct 2022
01 Oct 2022

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics
Amir Mosavi ... Shahab Band
SSRN | VOL. -
Amir Mosavi, et. al.Amir Mosavi ... Shahab Band
01 Jan 2020
SSRN | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Macromanagement in Starcraft by Deep Reinforcement Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)