Generic uniqueness of the bias vector of finite zero-sum stochastic games with perfect information

Marianne Akian,Stéphane Gaubert,Antoine Hochart

doi:10.1016/j.jmaa.2017.07.017

Abstract

Mean-payoff zero-sum stochastic games can be studied by means of a nonlinear spectral problem. When the state space is finite, the latter consists in finding an eigenpair (u,λ) solution of T(u)=λe+u, where T:Rn→Rn is the Shapley (or dynamic programming) operator, λ is a scalar, e is the unit vector, and u∈Rn. The scalar λ yields the mean payoff per time unit, and the vector u, called bias, allows one to determine optimal stationary strategies in the mean-payoff game. The existence of the eigenpair (u,λ) is generally related to ergodicity conditions. A basic issue is to understand for which classes of games the bias vector is unique (up to an additive constant). In this paper, we consider perfect-information zero-sum stochastic games with finite state and action spaces, thinking of the transition payments as variable parameters, transition probabilities being fixed. We show that the bias vector, thought of as a function of the transition payments, is generically unique (up to an additive constant). The proof uses techniques of nonlinear Perron–Frobenius theory. As an application of our results, we obtain an explicit perturbation scheme allowing one to solve degenerate instances of stochastic games by policy iteration.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generic uniqueness of the bias vector of finite zero-sum stochastic games with perfect information

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Analysis and Applications

Lead the way for us

Journal: Journal of Mathematical Analysis and Applications	Publication Date: Jul 24, 2017
Citations: 21

Similar Papers

Generic uniqueness of the bias vector of mean payoff zero-sum games
Marianne Akian ... Antoine Hochart
-
Marianne Akian, et. al.Marianne Akian ... Antoine Hochart
01 Dec 2014
01 Dec 2014

Solving multichain stochastic games with mean payoff by policy iteration
Marianne Akian ... Stephane Gaubert
-
Marianne Akian, et. al.Marianne Akian ... Stephane Gaubert
01 Dec 2013
01 Dec 2013

A policy iteration algorithm for zero-sum stochastic games with mean payoff
Jean Cochet-Terrasson ... Stéphane Gaubert
Comptes Rendus. Mathématique | VOL. 343
Jean Cochet-Terrasson, et. al.Jean Cochet-Terrasson ... Stéphane Gaubert
10 Aug 2006
Comptes Rendus. Mathématique | VOL. 343

Duality in Markov Decision Problems with Countable Action and State Spaces
John P Evans
Management Science | VOL. 15
John P EvansJohn P Evans
01 Jul 1969
Management Science | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generic uniqueness of the bias vector of finite zero-sum stochastic games with perfect information

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Analysis and Applications