Denumerable controlled Markov chains with average reward criterion: Sample path optimality

Rolando Cavazos-Cadena,Emmanuel Fern�Ndez-Gaucherand

doi:10.1007/bf01415067

Denumerable controlled Markov chains with average reward criterion: Sample path optimality

Rolando Cavazos-Cadena, Emmanuel Fern�Ndez-Gaucherand

https://doi.org/10.1007/bf01415067

Copy DOI

Journal: Mathematical methods of operations research (Heidelberg, Germany)	Publication Date: Feb 1, 1995
Citations: 22

Affiliation: Universidad Autónoma Agraria Antonio Narro, University of Arizona

#Average Reward Optimality #Sample Path + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We consider discrete-time nonlinear controlled stochastic systems, modeled by controlled Makov chains with denumerable state space and compact action space. The corresponding stochastic control problem of maximizing average rewards in the long-run is studied. Departing from the most common position which usesexpected values of rewards, we focus on a sample path analysis of the stream of states/rewards. Under a Lyapunov function condition, we show that stationary policies obtained from the average reward optimality equation are not only average reward optimal, but indeed sample path average reward optimal, for almost all sample paths.

Full Text