Imperfect-Information Game AI Agent Based on Reinforcement Learning Using Tree Search and a Deep Neural Network

Xin Ouyang,Ting Zhou

doi:10.3390/electronics12112453

Abstract

In the field of computer intelligence, it has always been a challenge to construct an agent model that can be adapted to various complex tasks. In recent years, based on the planning algorithm of Monte Carlo tree search (MCTS), a new idea has been proposed to solve the AI problems of two-player zero-sum games such as chess and Go. However, most of the games in the real environment rely on imperfect information, so it is impossible to directly use the normal tree search planning algorithm to construct a decision-making model. Mahjong, which is a popular multiplayer game with a long history in China, attracts great attention from AI researchers because it contains a large game state space and a lot of hidden information. In this paper, we utilize an agent learning approach that leverages deep learning, reinforcement learning, and dropout learning techniques to implement a Mahjong AI game agent. First, we improve the state transition of the tree search based on the learned MDP model, the player position variable and transition information are introduced into the tree search algorithm to construct a multiplayer search tree. Then, the model training based on a deep reinforcement learning method ensures the stable and sustainable training process of the learned MDP model. Finally, we utilize the strategy data generated by the tree search and use the dropout learning method to train the normal decision-making agent. The experimental results demonstrate the efficiency and stability performance of the agent trained by our proposed method compared with existing agents in terms of test data accuracy, tournament ranking performance, and online match performance. The agent plays against human players and acts like real humans.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: May 29, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Imperfect-Information Game AI Agent Based on Reinforcement Learning Using Tree Search and a Deep Neural Network

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Study on deep reinforcement learning techniques for building energy consumption forecasting
Tao Liu ... Zhengfei Li
Energy and Buildings | VOL. 208
Tao Liu, et. al.Tao Liu ... Zhengfei Li
03 Dec 2019
Energy and Buildings | VOL. 208

Guest Editorial Special Issue on Deep/Reinforcement Learning and Games
I.-C Wu ... Y Tian
IEEE Transactions on Games | VOL. 10
I.-C Wu, et. al.I.-C Wu ... Y Tian
01 Dec 2018
IEEE Transactions on Games | VOL. 10

What can classic Atari video games tell us about the human brain?
Raphael Köster ... Martin J Chadwick
Neuron | VOL. 109
Raphael Köster, et. al.Raphael Köster ... Martin J Chadwick
01 Feb 2021
Neuron | VOL. 109

De Novo Drug Design Using Transformer-Based Machine Translation and Reinforcement Learning of an Adaptive Monte Carlo Tree Search.
Dony Ang ... Cyril Rakovski
Pharmaceuticals | VOL. 17
Dony Ang, et. al.Dony Ang ... Cyril Rakovski
27 Jan 2024
Pharmaceuticals | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Imperfect-Information Game AI Agent Based on Reinforcement Learning Using Tree Search and a Deep Neural Network

Abstract

Talk to us

Similar Papers

More From: Electronics