Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces

Nicola Gatti ,Alberto Marchesi ,Francesco Trovò

doi:10.48448/6ap8-1p72

Abstract

We tackle the problem of learning equilibria in simulation-based games. In such games, the players' utility functions cannot be described analytically, as they are given through a black-box simulator that can be queried to obtain noisy estimates of the utilities. This is the case in many real-world games in which a complete description of the elements involved is not available upfront, such as complex military settings and online auctions. In these situations, one usually needs to run costly simulation processes to get an accurate estimate of the game outcome. As a result, solving these games begets the challenge of designing learning algorithms that can find (approximate) equilibria with high confidence, using as few simulator queries as possible. Moreover, since running the simulator during the game is unfeasible, the algorithms must first perform a pure exploration learning phase and, then, use the (approximate) equilibrium learned this way to play the game. In this work, we focus on two-player zero-sum games with infinite strategy spaces. Drawing from the best arm identification literature, we design two algorithms with theoretical guarantees to learn maximin strategies in these games. The first one works in the fixed-confidence setting, guaranteeing the desired confidence level while minimizing the number of queries. Instead, the second algorithm fits the fixed-budget setting, maximizing the confidence without exceeding the given maximum number of queries. First, we formally prove delta-PAC theoretical guarantees for our algorithms under some regularity assumptions, which are encoded by letting the utility functions be drawn from a Gaussian process. Then, we experimentally evaluate our techniques on a testbed made of randomly generated games and instances representing simple real-world security settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Successful Nash Equilibrium Agent for a Three-Player Imperfect-Information Game
Sam Ganzfried ... Austin Nowak
Games | VOL. 9
Sam Ganzfried, et. al.Sam Ganzfried ... Austin Nowak
08 Jun 2018
Games | VOL. 9

Polynomial stochastic games via sum of squares optimization
Parikshit Shah ... Pablo A Parrilo
-
Parikshit Shah, et. al.Parikshit Shah ... Pablo A Parrilo
01 Jan 2007
01 Jan 2007

The Evolution of Fuzzy Rules as Strategies in Two-Player Games
James E West ... Bruce Linster
Southern Economic Journal | VOL. 69
James E West, et. al.James E West ... Bruce Linster
01 Jan 2003
Southern Economic Journal | VOL. 69

Characterization and computation of correlated equilibria in infinite games
Noah D Stein ... Asuman Ozdaglar
-
Noah D Stein, et. al.Noah D Stein ... Asuman Ozdaglar
01 Jan 2007
01 Jan 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces

Abstract

Talk to us

Similar Papers