Abstract

We consider a dynamic approach to games in extensive forms. By restricting the convertibility relation over strategy profiles, we obtain a semi-potential (in the sense of Kukushkin), and we show that in finite sequential games the corresponding restriction of better-response dynamics will converge to a Nash equilibrium in quadratic time. Convergence happens on a per-player basis, and even in the presence of players with cyclic preferences, the players with acyclic preferences will stabilize. Thus, we obtain a candidate notion for rationality in the presence of irrational agents. Moreover, the restriction of convertibility can be justified by a conservative updating of beliefs about the other players strategies. For infinite games in extensive form we can retain convergence to a Nash equilibrium (in some sense), if the preferences are given by continuous payoff functions; or obtain a transfinite convergence if the outcome sets of the game are Delta ^0_2-sets.

Highlights

  • The Nash equilibria are the fixed points of the better response dynamics. In graph theory they would be called the sinks of these dynamics, and in computer science they may be called their terminal strategy profiles

  • – In a finite game with acyclic preferences, the dynamics stabilizes at a Nash equilibrium after a quadratic number of steps

  • A strategy profile s in S := a∈A Sa is a Nash equilibrium if it makes every Player a stable, i.e. v(s) ⊀a v(s ) for all s ∈ S that differ from s at most at the a-component

Read more

Summary

Introduction

The Nash equilibria are the fixed points of the better (or best) response dynamics In graph theory they would be called the sinks of these dynamics, and in computer science they may be called their terminal strategy profiles. Better-response dynamics will always improve the potential, and terminates (at a Nash equilibrium) if the game is finite. – In a finite game with acyclic preferences, the dynamics stabilizes at a Nash equilibrium after a quadratic number of steps. The main advantage of our two proofs of termination is that they work on a per-player basis: Each player with acyclic preference will terminate, even in the presence of players with cyclic preferences This is far from obvious, as in most dynamics a single player who keeps altering their choices can induce the other players to keep changing, too. Outcome sets, and show that transfinite continuation of the dynamics still stabilizes

Related Work
Background and Notation
Defining Lazy Improvement
Termination in Finite Games
Lazy Non-worsening
Lazy Convertibility as Belief Updating
Lazy Improvement in Infinite Games
Deepening Lazy Improvement
Fair Lazy Improvement
Transfinite Lazy Improvement in the Difference Hierarchy
Some Counterexamples

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.