AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms

Hang Xu,Kai Li,Junliang Xing,Haobo Fu,Qiang Fu

doi:10.1609/aaai.v36i5.20460

Abstract

Counterfactual regret minimization (CFR) is the most commonly used algorithm to approximately solving two-player zero-sum imperfect-information games (IIGs). In recent years, a series of novel CFR variants such as CFR+, Linear CFR, DCFR have been proposed and have significantly improved the convergence rate of the vanilla CFR. However, most of these new variants are hand-designed by researchers through trial and error based on different motivations, which generally requires a tremendous amount of efforts and insights. This work proposes to meta-learn novel CFR algorithms through evolution to ease the burden of manual algorithm design. We first design a search language that is rich enough to represent many existing hand-designed CFR variants. We then exploit a scalable regularized evolution algorithm with a bag of acceleration techniques to efficiently search over the combinatorial space of algorithms defined by this language. The learned novel CFR algorithm can generalize to new IIGs not seen during training and performs on par with or better than existing state-of-the-art CFR variants. The code is available at https://github.com/rpSebastian/AutoCFR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 2

Similar Papers

Automatically designing counterfactual regret minimization algorithms for solving imperfect-information games
Kai Li ... Junliang Xing
Artificial Intelligence | VOL. 337
Kai Li, et. al.Kai Li ... Junliang Xing
11 Oct 2024
Artificial Intelligence | VOL. 337

Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent
Edward Lockhart ... Marc Lanctot
-
Edward Lockhart, et. al.Edward Lockhart ... Marc Lanctot
01 Aug 2019
01 Aug 2019

Scalable sub-game solving for imperfect-information games
Huale Li ... Shuhan Qi
Knowledge-Based Systems | VOL. 231
Huale Li, et. al.Huale Li ... Shuhan Qi
26 Aug 2021
Knowledge-Based Systems | VOL. 231

Value functions for depth-limited solving in zero-sum imperfect-information games
Vojtěch Kovařík ... Karel Ha
Artificial Intelligence | VOL. 314
Vojtěch Kovařík, et. al.Vojtěch Kovařík ... Karel Ha
19 Oct 2022
Artificial Intelligence | VOL. 314

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence