Mirror descent learning in continuous games

Zhengyuan Zhou,Panayotis Mertikopoulos,Nicholas Bambos,Peter Glynn,Aris L Moustakas

doi:10.1109/cdc.2017.8264532

Abstract

Online Mirror Descent (OMD) is an important and widely used class of adaptive learning algorithms that enjoys good regret performance guarantees. It is therefore natural to study the evolution of the joint action in a multi-agent decision process (typically modeled as a repeated game) where every agent employs an OMD algorithm. This well-motivated question has received much attention in the literature that lies at the intersection between learning and games. However, much of the existing literature has been focused on the time average of the joint iterates. In this paper, we tackle a harder problem that is of practical utility, particularly in the online decision making setting: the convergence of the last iterate when all the agents make decisions according to OMD. We introduce an equilibrium stability notion called variational stability (VS) and show that in variationally stable games, the last iterate of OMD converges to the set of Nash equilibria. We also extend the OMD learning dynamics to a more general setting where the exact gradient is not available and show that the last iterate (now random) of OMD converges to the set of Nash equilibria almost surely.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mirror descent learning in continuous games

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Scaling Mean Field Games with Online Mirror Descent

-

20 Apr 2022
20 Apr 2022

Scaling Mean Field Games with Online Mirror Descent

-

20 Apr 2022
20 Apr 2022

Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent
Gabriele Farina ... Tuomas Sandholm
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Gabriele Farina, et. al.Gabriele Farina ... Tuomas Sandholm
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

A continuous-time approach to online optimization
Joon Kwon ... Panayotis Mertikopoulos
Journal of Dynamics & Games | VOL. 4
Joon Kwon, et. al.Joon Kwon ... Panayotis Mertikopoulos
16 Oct 2016
Journal of Dynamics & Games | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mirror descent learning in continuous games

Abstract

Talk to us

Similar Papers