Abstract
We suggest a new algorithm for two-person zero-sum undiscounted stochastic games focusing on stationary strategies. Given a positive real $$\varepsilon $$ , let us call a stochastic game $$\varepsilon $$ -ergodic, if its values from any two initial positions differ by at most $$\varepsilon $$ . The proposed new algorithm outputs for every $$\varepsilon >0$$ in finite time either a pair of stationary strategies for the two players guaranteeing that the values from any initial positions are within an $$\varepsilon $$ -range, or identifies two initial positions u and v and corresponding stationary strategies for the players proving that the game values starting from u and v are at least $$\varepsilon /24$$ apart. In particular, the above result shows that if a stochastic game is $$\varepsilon $$ -ergodic, then there are stationary strategies for the players proving $$24\varepsilon $$ -ergodicity. This result strengthens and provides a constructive version of an existential result by Vrieze (Stochastic games with finite state and action spaces. PhD thesis, Centrum voor Wiskunde en Informatica, Amsterdam, 1980) claiming that if a stochastic game is 0-ergodic, then there are $$\varepsilon $$ -optimal stationary strategies for every $$\varepsilon > 0$$ . The suggested algorithm is based on a potential transformation technique that changes the range of local values at all positions without changing the normal form of the game.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.