Value set iteration for two-person zero-sum Markov games

Hyeong Soo Chang

doi:10.1016/j.automatica.2016.10.010

Value set iteration for two-person zero-sum Markov games

Hyeong Soo Chang

https://doi.org/10.1016/j.automatica.2016.10.010

Copy DOI

Export

Save

Cite

Journal: Automatica	Publication Date: Dec 7, 2016
Citations: 1

Affiliation: Sogang University

#Markov Games #Two-person Zero-sum Markov Games #Linear Convergence Rate #Two-person Games #Value Iteration #Value Function #Operator In Space #Linear Rate #Multiple Methods #Zero-sum Markov Games

Abstract
Full-Text
Similar Papers

Abstract

Listen

We present a novel exact algorithm called “value set iteration” (VSI) for solving two-person zero-sum Markov games (MGs) as a generalization of value iteration (VI) and as a general framework of combining multiple solution methods. We introduce a novel operator in the value function space and iteratively apply the operator with any sequence of the set of policies, extending Chang’s VSI for MDPs into the MG setting. We show that VSI for MGs converges to the equilibrium value function with at least linear convergence rate and establish that VSI can potentially improve the convergence speed in terms of the number of iterations by proper setting of the sequence of the set of policies.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Automatica

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Value set iteration for two-person zero-sum Markov games