A policy iteration algorithm for zero-sum stochastic games with mean payoff

Jean Cochet-Terrasson,Stéphane Gaubert

doi:10.1016/j.crma.2006.07.011

A policy iteration algorithm for zero-sum stochastic games with mean payoff

Jean Cochet-Terrasson, Stéphane Gaubert

Open Access

https://doi.org/10.1016/j.crma.2006.07.011

Copy DOI

Journal: Comptes Rendus. Mathématique	Publication Date: Aug 10, 2006
Citations: 16	License type: mit

Affiliation: French Institute for Research in Computer Science and Automation

#Zero-sum Stochastic Games #Policy Iteration Algorithm + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We give a policy iteration algorithm to solve zero-sum stochastic games with finite state and action spaces and perfect information, when the value is defined in terms of the mean payoff per turn. This algorithm does not require any irreducibility assumption on the Markov chains determined by the strategies of the players. It is based on a discrete nonlinear analogue of the notion of reduction of a super-harmonic function. To cite this article: J. Cochet-Terrasson, S. Gaubert, C. R. Acad. Sci. Paris, Ser. I 343 (2006).

Full Text