Recurrence Conditions for Average and Blackwell Optimality in Denumerable State Markov Decision Chains

Arie Hordijk,Rommert Dekker

doi:10.1287/moor.17.2.271

Recurrence Conditions for Average and Blackwell Optimality in Denumerable State Markov Decision Chains

Arie Hordijk, Rommert Dekker

Open Access

https://doi.org/10.1287/moor.17.2.271

Copy DOI

Journal: Mathematics of Operations Research	Publication Date: Jan 1, 1992
Citations: 33

Affiliation: University of Applied Sciences Leiden, Shell (Netherlands)

#Blackwell Optimality #Blackwell Optimal Policies + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In a previous paper Dekker and Hordijk (1988) presented an operator theoretical approach for multichain Markov decision processes with a countable state space, compact action sets and unbounded rewards. Conditions were presented guaranteeing the existence of a Laurent series expansion for the discounted rewards, the existence of average and Blackwell optimal policies and the existence of solutions for the average and Blackwell optimality equations. While these assumptions were operator oriented and formulated as conditions for the deviation matrix, we will show in this paper that the same approach can also be carried out under recurrence conditions. These new conditions seem easier to check in general and are especially suited for applications in queueing models.

Full Text