A counterexample on the optimality equation in Markov decision chains with the average cost criterion

Rolando Cavazos-Cadena

doi:10.1016/0167-6911(91)90060-r

A counterexample on the optimality equation in Markov decision chains with the average cost criterion

Rolando Cavazos-Cadena

https://doi.org/10.1016/0167-6911(91)90060-r

Copy DOI

Journal: Systems & Control Letters	Publication Date: May 1, 1991
Citations: 43

Affiliation: Universidad Autónoma Agraria Antonio Narro

#Denumerable State Space #Markov Decision Chains + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We consider Markov decision processes with denumerable state space and finite control sets; the performance index of a control policy is a long-run expected average cost criterion and the cost function is bounded below. For these models, the existence of average optimal stationary policies was recently established in [11] under very general assumptions. Such a result was obtained via an optimality inequality. Here, we use a simple example to prove that the conditions in [11] do not imply the existence of a solution to the average cost optimality equation.

Full Text