World-class interpretable poker

Dimitris Bertsimas,Alex Paskov

doi:10.1007/s10994-022-06179-8

Abstract

We address the problem of interpretability in iterative game solving for imperfect-information games such as poker. This lack of interpretability has two main sources: first, the use of an uninterpretable feature representation, and second, the use of black box methods such as neural networks, for the fitting procedure. In this paper, we present advances on both fronts. Namely, first we propose a novel, compact, and easy-to-understand game-state feature representation for Heads-up No-limit (HUNL) Poker. Second, we make use of globally optimal decision trees, paired with a counterfactual regret minimization (CFR) self-play algorithm, to train our poker bot which produces an entirely interpretable agent. Through experiments against Slumbot, the winner of the most recent Annual Computer Poker Competition, we demonstrate that our approach yields a HUNL Poker agent that is capable of beating the Slumbot. Most exciting of all, the resulting poker bot is highly interpretable, allowing humans to learn from the novel strategies it discovers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Jun 9, 2022
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

World-class interpretable poker

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Using counterfactual regret minimization to create competitive multiplayer poker agents
...
-
, et. al. ...
10 May 2010
10 May 2010

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
Hang Xu ... Jian Cheng
-
Hang Xu, et. al.Hang Xu ... Jian Cheng
01 Aug 2024
01 Aug 2024

Scalable sub-game solving for imperfect-information games
Huale Li ... Shuhan Qi
Knowledge-Based Systems | VOL. 231
Huale Li, et. al.Huale Li ... Shuhan Qi
26 Aug 2021
Knowledge-Based Systems | VOL. 231

Rethinking Formal Models of Partially Observable Multiagent Decision Making (Extended Abstract)
Vojtěch Kovařík ... Martin Schmid
-
Vojtěch Kovařík, et. al.Vojtěch Kovařík ... Martin Schmid
01 Aug 2023
01 Aug 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

World-class interpretable poker

Abstract

Talk to us

Similar Papers

More From: Machine Learning