Abstract
AbstractIn an ε-Nash equilibrium, a player can gain at most ε by unilaterally changing his behaviour. For two-player (bimatrix) games with payoffs in [0,1], the best-known ε achievable in polynomial time is 0.3393 [23]. In general, for n-player games an ε-Nash equilibrium can be computed in polynomial time for an ε that is an increasing function of n but does not depend on the number of strategies of the players. For three-player and four-player games the corresponding values of \({\epsilon} \) are 0.6022 and 0.7153, respectively. Polymatrix games are a restriction of general n-player games where a player’s payoff is the sum of payoffs from a number of bimatrix games. There exists a very small but constant \({\epsilon} \) such that computing an \({\epsilon} \)-Nash equilibrium of a polymatrix game is PPAD-hard. Our main result is that an (0.5 + δ)-Nash equilibrium of an n-player polymatrix game can be computed in time polynomial in the input size and \(\frac{1}{\delta}\). Inspired by the algorithm of Tsaknakis and Spirakis [23], our algorithm uses gradient descent on the maximum regret of the players.KeywordsNash EquilibriumGradient DescentSteep DescentApproximation GuaranteeApproximate EquilibriumThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Submitted Version (Free)
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have