A new policy iteration scheme for Markov decision processes using Schweitzer's formula

J B Lasserre

doi:10.2307/3215254

A new policy iteration scheme for Markov decision processes using Schweitzer's formula

J B Lasserre

https://doi.org/10.2307/3215254

Copy DOI

Journal: Journal of Applied Probability	Publication Date: Mar 1, 1994
Citations: 3

#Family Of Markov Chains #Scheme For Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Given a family of Markov chains with a single recurrent class, we present a potential application of Schweitzer's exact formula relating the steady-state probability and fundamental matrices of any two chains in the family. We propose a new policy iteration scheme for Markov decision processes where in contrast to policy iteration, the new criterion for selecting an action ensures the maximal one-step average cost improvement. Its computational complexity and storage requirement are analysed.

Full Text