Myopic Best-Response Learning in Large-Scale Games

Brian Swenson

doi:10.1184/r1/6720788.v1

Abstract

This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games with many players. We focus our study on a set of learning dynamics in which agents seek to myopically optimize their next-stage utility given some forecast of opponent behavior; i.e., players act according to myopic best response dynamics. The prototypical algorithm in this class is the well-known fictitious play (FP) algorithm. FP dynamics are intuitively simple and can be seen as the \natural learning dynamics associated with the Nash equilibrium concept. Accordingly, FP has received extensive study over the years and has been used in a variety of applications. Our contributions may be divided into two main research areas. First, we study fundamental properties of myopic best response (MBR) dynamics in large-scale games. We have three main contributions in this area. (i) We characterize the robustness of MBR dynamics to a class of perturbations common in real-world applications. (ii) We study FP dynamics in the important class of large-scale games known as potential games. We show that for almost all potential games and for almost all initial conditions, FP converges to a pure-strategy (deterministic) equilibrium. (iii) We develop tools to characterize the rate of convergence of MBR algorithms in potential games. In particular, we show that the rate of convergence of FP is \almost always exponential in potential games. Our second research focus concerns implementation of MBR learning dynamics in large-scale games. MBR dynamics can be shown, theoretically, to converge to equilibrium strategies in important classes of large-scale games (e.g., potential games). However, despite theoretical convergence guarantees, MBR dynamics can be extremely impractical to implement in large games due to demanding requirements in terms of computational capacity, information overhead, communication infrastructure, and global synchronization. Using the aforementioned robustness result, we study practical methods to mitigate each of these issues. We place a special emphasis on studying algorithms that may be implemented in a network-based setting, i.e., a setting in which inter-agent communication is restricted to a (possibly sparse) overlaid communication graph. Within the network-based setting, we also study the use of so-called \inertia in MBR algorithms as a tool for learning pure-strategy NE.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Myopic Best-Response Learning in Large-Scale Games

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Empirical Centroid Fictitious Play: An Approach for Distributed Learning in Multi-Agent Games
Brian Swenson ... Joao Xavier
IEEE Transactions on Signal Processing | VOL. 63
Brian Swenson, et. al.Brian Swenson ... Joao Xavier
16 Apr 2013
IEEE Transactions on Signal Processing | VOL. 63

On the exponential rate of convergence of fictitious play in potential games
Brian Swenson ... Soummya Kar
-
Brian Swenson, et. al.Brian Swenson ... Soummya Kar
01 Oct 2017
01 Oct 2017

Joint Strategy Fictitious Play with Inertia for Potential Games
J.R Marden ... G Arslan
-
J.R Marden, et. al.J.R Marden ... G Arslan
12 Dec 2005
12 Dec 2005

On robustness properties in Empirical Centroid Fictitious Play
Brian Swensony ... Joao Xavier
-
Brian Swensony, et. al.Brian Swensony ... Joao Xavier
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Myopic Best-Response Learning in Large-Scale Games

Abstract

Talk to us

Similar Papers