Tartanian7: A Champion Two-Player No-Limit Texas Hold'em Poker-Playing Program

Noam Brown,Sam Ganzfried,Tuomas Sandholm

doi:10.1609/aaai.v29i1.9285

Abstract

The leading approach for solving large imperfect-information games is automated abstraction followed by running an equilibrium-finding algorithm. We introduce a distributed version of the most commonly used equilibrium-finding algorithm, counterfactual regret minimization (CFR), which enables CFR to scale to dramatically larger abstractions and numbers of cores. The new algorithm begets constraints on the abstraction so as to make the pieces running on different computers disjoint. We introduce an algorithm for generating such abstractions while capitalizing on state-of-the-art abstraction ideas such as imperfect recall and the earth-mover's-distance similarity metric. Our techniques enabled an equilibrium computation of unprecedented size on a supercomputer with a high inter-blade memory latency. Prior approaches run slowly on this architecture. Our approach also leads to a significant improvement over using the prior best approach on a large shared-memory server with low memory latency. Finally, we introduce a family of post-processing techniques that outperform prior ones. We applied these techniques to generate an agent for two-player no-limit Texas Hold'em. It won the 2014 Annual Computer Poker Competition, beating each opponent with statistical significance.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tartanian7: A Champion Two-Player No-Limit Texas Hold'em Poker-Playing Program

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Mar 4, 2015
Citations: 4

Similar Papers

Hierarchical Abstraction, Distributed Equilibrium Computation, and Post-Processing, with Application to a Champion No-Limit Texas Hold'em Agent
...
-
, et. al. ...
04 May 2015
04 May 2015

Monte carlo sampling and regret minimization for equilibrium computation and decision-making in large extensive form games
...
-
, et. al. ...
01 Jan 2013
01 Jan 2013

Automated construction of bounded-loss imperfect-recall abstractions in extensive-form games
Jiří Čermák ... Branislav Bošanský
Artificial Intelligence | VOL. 282
Jiří Čermák, et. al.Jiří Čermák ... Branislav Bošanský
14 Feb 2020
Artificial Intelligence | VOL. 282

Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games
Sam Ganzfried ... Tuomas Sandholm
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 28
Sam Ganzfried, et. al.Sam Ganzfried ... Tuomas Sandholm
21 Jun 2014
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tartanian7: A Champion Two-Player No-Limit Texas Hold'em Poker-Playing Program

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence