A game strategy model in the digital curling system based on NFSP

Yuntao Han,Qibin Zhou,Fuqing Duan

doi:10.1007/s40747-021-00345-6

Yuntao Han, Qibin Zhou + Show 1 more

Open Access

https://doi.org/10.1007/s40747-021-00345-6

Copy DOI

Journal: Complex & Intelligent Systems	Publication Date: Mar 31, 2021
Citations: 7	License type: open-access

Affiliation: Beijing Normal University

Abstract

The digital curling game is a two-player zero-sum extensive game in a continuous action space. There are some challenging problems that are still not solved well, such as the uncertainty of strategy, the large game tree searching, and the use of large amounts of supervised data, etc. In this work, we combine NFSP and KR-UCT for digital curling games, where NFSP uses two adversary learning networks and can automatically produce supervised data, and KR-UCT can be used for large game tree searching in continuous action space. We propose two reward mechanisms to make reinforcement learning converge quickly. Experimental results validate the proposed method, and show the strategy model can reach the Nash equilibrium.

Highlights

For a long time, machine games and artificial intelligence have been closely related, and machine games are an important form of artificial intelligence
From the game theory of von Neumann [1], the father of computers, to the well-known AlphaGo [2], today, machine games have always been in the public eyes
We combine neural fictitious self-play (NFSP) and KR-Upper Confidence Bounds Applied to Trees (UCT) for digital curling games, where NFSP can avoid manual labeling of supervised data, and KR-UCT can be used for large game tree searching in continuous action space

Summary

Introduction

Machine games and artificial intelligence have been closely related, and machine games are an important form of artificial intelligence. Digital curling game has many action strategies, large search space and strong uncertainty, and it is a typical extensive form game [3]. For extensive form games, some challenging problems are still not solved well, such as the uncertainty of strategy, the large game tree searching, and the use of large amounts of supervised data, etc. Some researchers used Monte Carlo search and KR-UCT algorithms to improve the game performance These solutions still need a lot of supervised data and prior knowledge. We combine NFSP and KR-UCT for digital curling games, where NFSP can avoid manual labeling of supervised data, and KR-UCT can be used for large game tree searching in continuous action space.

Related works

Methods

Experiments

Results and discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A game strategy model in the digital curling system based on NFSP

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

Energy management of hybrid electric bus based on deep reinforcement learning in continuous state and action space
Huachun Tan ... Yuankai Wu
Energy Conversion and Management | VOL. 195
Huachun Tan, et. al.Huachun Tan ... Yuankai Wu
18 May 2019
Energy Conversion and Management | VOL. 195

Monte Carlo Tree Search in Continuous Spaces Using Voronoi Optimistic Optimization with Regret Bounds
Beomjoon Kim ... Sungbin Lim
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Beomjoon Kim, et. al.Beomjoon Kim ... Sungbin Lim
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Action decoupled SAC reinforcement learning with discrete-continuous hybrid action spaces
Yahao Xu ... Hongbin Deng
Neurocomputing | VOL. 537
Yahao Xu, et. al.Yahao Xu ... Hongbin Deng
31 Mar 2023
Neurocomputing | VOL. 537

Stochastic fictitious play with continuous action sets
S Perkins ... D.S Leslie
Journal of Economic Theory | VOL. 152
S Perkins, et. al.S Perkins ... D.S Leslie
25 Apr 2014
Journal of Economic Theory | VOL. 152

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A game strategy model in the digital curling system based on NFSP

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Complex &amp; Intelligent Systems

More From: Complex & Intelligent Systems