Generalized reinforcement learning in perfect-information games

Maxwell Pak,Bing Xu

doi:10.1007/s00182-015-0499-1

Abstract

This paper studies reinforcement learning in which players base their action choice on valuations they have for the actions. We identify two general conditions on valuation updating rules that together guarantee that the probability of playing a subgame perfect Nash equilibrium (SPNE) converges to one in games where no player is indifferent between two outcomes without every other player being also indifferent. The same conditions guarantee that the fraction of times a SPNE is played converges to one almost surely. We also show that for additively separable valuations, in which valuations are the sum of empirical and error terms, the conditions guaranteeing convergence can be made more intuitive. In addition, we give four examples of valuations that satisfy our conditions. These examples represent different degrees of sophistication in learning behavior and include well-known examples of reinforcement learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalized reinforcement learning in perfect-information games

Abstract

Talk to us

Similar Papers

More From: International Journal of Game Theory

Lead the way for us

Journal: International Journal of Game Theory	Publication Date: Sep 29, 2015
Citations: 6

Similar Papers

Learning models in interdependence situations
...
-
, et. al. ...
18 Nov 2015
18 Nov 2015

Participation and Demand Levels for a Joint Project
Ryusuke Shinohara
SSRN Electronic Journal | VOL. -
Ryusuke ShinoharaRyusuke Shinohara
30 Jan 2014
SSRN Electronic Journal | VOL. -

Participation and demand levels for a joint project
Ryusuke Shinohara
Social Choice and Welfare | VOL. 43
Ryusuke ShinoharaRyusuke Shinohara
18 Feb 2014
Social Choice and Welfare | VOL. 43

An “Ethical” Game-Theoretic Solution Concept for Two-Player Perfect-Information Games
Joshua Letchford ... Vincent Conitzer
-
Joshua Letchford, et. al.Joshua Letchford ... Vincent Conitzer
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalized reinforcement learning in perfect-information games

Abstract

Talk to us

Similar Papers

More From: International Journal of Game Theory