Multi-policy iteration with a distributed voting

Hyeong Soo Chang

doi:10.1007/s001860400362

Multi-policy iteration with a distributed voting

Hyeong Soo Chang

https://doi.org/10.1007/s001860400362

Copy DOI

Journal: Mathematical methods of operations research (Heidelberg, Germany)	Publication Date: Oct 1, 2004
Citations: 15

Affiliation: Sogang University

#Simulation-based Algorithm #Markov Decision Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We present a novel simulation-based algorithm, as an extension of the well-known policy iteration algorithm, by combining multi-policy improvement with a distributed simulation-based voting policy evaluation, for approximately solving Markov Decision Processes (MDPs) with infinite horizon discounted reward criterion, and analyze its performance relative to the optimal value.

Full Text