Semi-Markov decision processes with polynomial reward

Zvi Rosberg

doi:10.2307/3213482

Semi-Markov decision processes with polynomial reward

Zvi Rosberg

https://doi.org/10.2307/3213482

Copy DOI

Journal: Journal of Applied Probability	Publication Date: Jun 1, 1982
Citations: 6

Affiliation: Technion – Israel Institute of Technology

#Semi-Markov Decision Processes #Ergodicity Assumption + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

A semi-Markov decision process, with a denumerable multidimensional state space, is considered. At any given state only a finite number of actions can be taken to control the process. The immediate reward earned in one transition period is merely assumed to be bounded by a polynomial and a bound is imposed on a weighted moment of the next state reached in one transition. It is shown that under an ergodicity assumption there is a stationary optimal policy for the long-run average reward criterion. A queueing network scheduling problem, for which previous criteria are inapplicable, is given as an application.

Full Text