Imperfect Information Game Research Articles

The leading approach to solving large imperfect information games is to pre-calculate an approximate solution using a simplified abstraction of the full game; that solution is then used to play the original, full-scale game. The abstraction step is necessitated by the size of the game tree. However, as the original game progresses, the remaining portion of the tree (the subgame) becomes smaller. An appealing idea is to use the simplified abstraction to play the early parts of the game and then, once the subgame becomes tractable, to calculate a solution using a finer-grained abstraction in real time, creating a combined final strategy. While this approach is straightforward for perfect information games, it is a much more complex problem for imperfect information games. If the subgame is solved locally, the opponent can alter his play in prior to this subgame to exploit our combined strategy. To prevent this, we introduce the notion of subgame margin, a simple value with appealing properties. If any best response reaches the subgame, the improvement of exploitability of the combined strategy is (at least) proportional to the subgame margin. This motivates subgame refinements resulting in large positive margins. Unfortunately, current techniques either neglect subgame margin (potentially leading to a large negative subgame margin and drastically more exploitable strategies), or guarantee only non-negative subgame margin (possibly producing the original, unrefined strategy, even if much stronger strategies are possible). Our technique remedies this problem by maximizing the subgame margin and is guaranteed to find the optimal solution. We evaluate our technique using one of the top participants of the AAAI-14 Computer Poker Competition, the leading playground for agents in imperfect information setting

Read full abstract

We consider the problem of playing a repeated two-player zero-sum game safety: that is, guaranteeing at least the value of the game per period in expectation regardless of the strategy used by the opponent. Playing a stage-game equilibrium strategy at each time step clearly guarantees safety, and prior work has (incorrectly) stated that it is impossible to simultaneously deviate from a stage-game equilibrium (in hope of exploiting a suboptimal opponent) and to guarantee safety. We show that such profitable deviations are indeed possible specifically in games where certain types of “gift” strategies exist, which we define formally. We show that the set of strategies constituting such gifts can be strictly larger than the set of iteratively weakly-dominated strategies; this disproves another recent assertion which states that all noniteratively weakly dominated strategies are best responses to each equilibrium strategy of the other player. We present a full characterization of safe strategies, and develop efficient algorithms for exploiting suboptimal opponents while guaranteeing safety. We also provide analogous results for extensive-form games of perfect and imperfect information, and present safe exploitation algorithms and full characterizations of safe strategies for those settings as well. We present experimental results in Kuhn poker, a canonical test problem for game-theoretic algorithms. Our experiments show that (1) aggressive safe exploitation strategies significantly outperform adjusting the exploitation within stage-game equilibrium strategies only and (2) all the safe exploitation strategies significantly outperform a (nonsafe) best response strategy against strong dynamic opponents.

Read full abstract

Imperfect Information Game Research Articles

Related Topics

Articles published on Imperfect Information Game

WEBJET’S COMPETITIVE STRATEGY FROM A GAME THEORY PERSPECTIVE

Strategies in a Stochastic Pursuer-Evader Game

Linking poverty and the environment: participatory community-based approaches, games of imperfect information and the Convention on Biological Diversity

Dynamic Thresholding and Pruning for Regret Minimization

The Efficiency of the HyperPlay Technique Over Random Sampling

Doomsday equilibria for omega-regular games

A model of scholarly publishing with hybrid academic journals

A Decision Making Method Based on Society of Mind Theory in Multi-Player Imperfect Information Games

Refining Subgames in Large Imperfect Information Games

Optimal Relevance in Imperfect Information Games

A work point count system coupled with back-propagation for solving double dummy bridge problem

Safe Opponent Exploitation

Uniform strategies, rational relations and jumping automata

Tartanian7: A Champion Two-Player No-Limit Texas Hold'em Poker-Playing Program

Setting up charging electric stations within residential communities in current China: Gaming of government agencies and property management companies

Using Modified UCT Algorithm Basing on Risk Estimation Methods in Imperfect Information Games

ALTERNATIVE SELECTION FUNCTIONS FOR INFORMATION SET MONTE CARLO TREE SEARCH

Modified UCT Algorithm with Risk Dominance Methods in Imperfect Information Game

Solving Imperfect Information Games Using Decomposition

Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Imperfect Information Game Research Articles

Related Topics

Articles published on Imperfect Information Game

WEBJET’S COMPETITIVE STRATEGY FROM A GAME THEORY PERSPECTIVE

Strategies in a Stochastic Pursuer-Evader Game

Linking poverty and the environment: participatory community-based approaches, games of imperfect information and the Convention on Biological Diversity

Dynamic Thresholding and Pruning for Regret Minimization

The Efficiency of the HyperPlay Technique Over Random Sampling

Doomsday equilibria for omega-regular games

A model of scholarly publishing with hybrid academic journals

A Decision Making Method Based on Society of Mind Theory in Multi-Player Imperfect Information Games

Refining Subgames in Large Imperfect Information Games

Optimal Relevance in Imperfect Information Games

A work point count system coupled with back-propagation for solving double dummy bridge problem

Safe Opponent Exploitation

Uniform strategies, rational relations and jumping automata

Tartanian7: A Champion Two-Player No-Limit Texas Hold'em Poker-Playing Program

Setting up charging electric stations within residential communities in current China: Gaming of government agencies and property management companies

Using Modified UCT Algorithm Basing on Risk Estimation Methods in Imperfect Information Games

ALTERNATIVE SELECTION FUNCTIONS FOR INFORMATION SET MONTE CARLO TREE SEARCH

Modified UCT Algorithm with Risk Dominance Methods in Imperfect Information Game

Solving Imperfect Information Games Using Decomposition

Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games