Student of Games: A unified learning algorithm for both perfect and imperfect information games.

Martin Schmid,Neil Burch,G Zacharias Holland,Elnaz Davoodi,Nolan Bard,Rudolf Kadlec,Matej Moravčík,Josh Davidson,Marc Lanctot,Michael Bowling,Kevin Waugh,Finbarr Timbers,Alden Christianson

doi:10.1126/sciadv.adg3256

Abstract

Games have a long history as benchmarks for progress in artificial intelligence. Approaches using search and learning produced strong performance across many perfect information games, and approaches using game-theoretic reasoning and learning demonstrated strong performance for specific imperfect information poker variants. We introduce Student of Games, a general-purpose algorithm that unifies previous approaches, combining guided search, self-play learning, and game-theoretic reasoning. Student of Games achieves strong empirical performance in large perfect and imperfect information games-an important step toward truly general algorithms for arbitrary environments. We prove that Student of Games is sound, converging to perfect play as available computation and approximation capacity increases. Student of Games reaches strong performance in chess and Go, beats the strongest openly available agent in heads-up no-limit Texas hold'em poker, and defeats the state-of-the-art agent in Scotland Yard, an imperfect information game that illustrates the value of guided search, learning, and game-theoretic reasoning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Science Advances	Publication Date: Nov 17, 2023
Citations: 1	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Student of Games: A unified learning algorithm for both perfect and imperfect information games.

Abstract

Talk to us

Similar Papers

More From: Science Advances

Lead the way for us

Similar Papers

Research of artificial intelligence in imperfect information card games
Megan Sun
Applied and Computational Engineering | VOL. 33
Megan SunMegan Sun
04 Feb 2024
Applied and Computational Engineering | VOL. 33

A study of Monte Carlo Methods for Phantom Go

-

01 Jan 2013
01 Jan 2013

A study on strategy acquisition on imperfect information game by UCT search
Yuki Takaoka ... Takashi Kawakami
-
Yuki Takaoka, et. al.Yuki Takaoka ... Takashi Kawakami
01 Dec 2017
01 Dec 2017

Adaptive Regret Minimization in Bounded-Memory Games
Jeremiah Blocki ... Anupam Datta
-
Jeremiah Blocki, et. al.Jeremiah Blocki ... Anupam Datta
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Student of Games: A unified learning algorithm for both perfect and imperfect information games.

Abstract

Talk to us

Similar Papers

More From: Science Advances