Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains

Rolando Cavazos-Cadena

doi:10.1016/0167-6911(88)90043-6

Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains

Rolando Cavazos-Cadena

https://doi.org/10.1016/0167-6911(88)90043-6

Copy DOI

Journal: Systems & Control Letters	Publication Date: Jan 1, 1988
Citations: 21

Affiliation: Universidad Autónoma Agraria Antonio Narro

#Average Reward Markov Decision Processes #Average Reward Markov Decision + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We consider average reward Markov decision processes with discrete time parameter and denumerable state space. We are concerned with the following problem: Find necessary and sufficient conditions so that, for arbitrary bounded reward function, the corresponding average reward optimality equation has a bounded solution. This problem is solved for a class of systems including the case in which, under the action of any stationary policy, the state space is an irreducible positive recurrent class.

Full Text