Are limits of α-discounted optimal policies Blackwell optimal? A counterexample

Arie Hordijk,Flos Spieksma

doi:10.1016/0167-6911(89)90018-2

Are limits of α-discounted optimal policies Blackwell optimal? A counterexample

Arie Hordijk, Flos Spieksma

https://doi.org/10.1016/0167-6911(89)90018-2

Copy DOI

Journal: Systems & control letters	Publication Date: Jul 1, 1989
Citations: 2

Affiliation: Leiden University

#Compact Action Sets #Finite Action Sets + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In Markov Decision Chains (MDC's) with a finite state space and finite action sets it is a well-known result, that the limits of α-discounted optimal policies, for α tending to 1, are Blackwell optimal. It was conjectured in a recent paper by Cavazos-Cadena and Lasserre, that this property of limiting policies also holds for unichain MDC's. We disprove this conjecture by constructing a non-Blackwell limiting policy in a unichain MDC with finitely many states and compact action sets.

Full Text