Deviation Matrix, Laurent Series and Blackwell Optimality in Countable State Markov Decision Processes

Yoshinobu Kadota

doi:10.1080/02331930211989

Deviation Matrix, Laurent Series and Blackwell Optimality in Countable State Markov Decision Processes

Yoshinobu Kadota

https://doi.org/10.1080/02331930211989

Copy DOI

Journal: Optimization

Publication Date: Jan 1, 2002

Affiliation: Wakayama University

#Laurent Series Expansion #Countable State Space + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper presents a recurrent condition on Markov decision processes with a countable state space and bounded rewards. The condition is sufficient for the existence of a Blackwell optimal stationary policy, having the Laurent series expansion with continuous coefficients. It is so relaxed that the Markov chain corresponding to a stationary policy may have countably many periodic recurrent classes. Our method finds the deviation matrix in an explicit form.

Full Text