Nonstationary continuous time markov decision processes with the expected total rewards criterion

Qiying Hu

doi:10.1080/02331939608844175

Nonstationary continuous time markov decision processes with the expected total rewards criterion

Qiying Hu

https://doi.org/10.1080/02331939608844175

Copy DOI

Journal: Optimization	Publication Date: Jan 1, 1996
Citations: 3

Affiliation: Xidian University

#Continuous Time Markov Decision Processes #Nonstationary Markov Decision Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper investigates the nonstationary continuous time Markov decision processes with the criterion of expected total rewards. Both the state space S and the action sets A(i) are countable, both the transition rates q ij (t,a) and the reward rate functions r,(t,a)are nonhomogeneous and unbounded. For this model, the optimality equation and the existence of ∊-optimal policies are proved. Finally, the period case for discounted criterion are discussed as the special case of the nonstationary one

Full Text