Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards

Arie Hordijk,Alexander A. Yushkevich

doi:10.1007/s001860050079

Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards

Arie Hordijk, Alexander A. Yushkevich

https://doi.org/10.1007/s001860050079

Copy DOI

Journal: Mathematical Methods of Operations Research (ZOR)	Publication Date: Dec 14, 1999
Citations: 41

Affiliation: Leiden University

#Blackwell Optimal #Markov Decision Chains + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper is the second part of our study of Blackwell optimal policies in Markov decision chains with a Borel state space and unbounded rewards. We prove that a stationary policy is Blackwell optimal in the class of all history-dependent policies if it is Blackwell optimal in the class of stationary policies.

Full Text