A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion

Rolando Cavazos-Cadena,Karel Sladký,Raúl Montes-De-Oca

doi:10.1007/s10957-013-0474-6

A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion

Rolando Cavazos-Cadena, Karel Sladký + Show 1 more

https://doi.org/10.1007/s10957-013-0474-6

Copy DOI

Journal: Journal of Optimization Theory and Applications	Publication Date: Nov 23, 2013
Citations: 5

Affiliation: Universidad Autónoma Agraria Antonio Narro, Czech Academy of Sciences, Institute of Information Theory and Automation, Universidad Autónoma Metropolitana

#Lyapunov Function Condition #Denumerable State Space + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This note deals with Markov decision chains evolving on a denumerable state space. Under standard continuity-compactness requirements, an explicit example is provided to show that, with respect to a strong sample-path average reward criterion, the Lyapunov function condition does not ensure the existence of an optimal stationary policy.

Full Text