New discount and average optimality conditions for continuous-time Markov decision processes

Xianping Guo,Liuer Ye

doi:10.1239/aap/1293113146

Abstract

This paper deals with continuous-time Markov decision processes in Polish spaces, under the discounted and average cost criteria. All underlying Markov processes are determined by given transition rates which are allowed to be unbounded, and the costs are assumed to be bounded below. By introducing an occupation measure of a randomized Markov policy and analyzing properties of occupation measures, we first show that the family of all randomized stationary policies is ‘sufficient’ within the class of all randomized Markov policies. Then, under the semicontinuity and compactness conditions, we prove the existence of a discounted cost optimal stationary policy by providing a value iteration technique. Moreover, by developing a new average cost, minimum nonnegative solution method, we prove the existence of an average cost optimal stationary policy under some reasonably mild conditions. Finally, we use some examples to illustrate applications of our results. Except that the costs are assumed to be bounded below, the conditions for the existence of discounted cost (or average cost) optimal policies are much weaker than those in the previous literature, and the minimum nonnegative solution approach is new.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

New discount and average optimality conditions for continuous-time Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Advances in Applied Probability

Lead the way for us

Journal: Advances in Applied Probability	Publication Date: Dec 1, 2010
Citations: 12

Similar Papers

New discount and average optimality conditions for continuous-time Markov decision processes
Xianping Guo ... Liuer Ye
Advances in Applied Probability | VOL. 42
Xianping Guo, et. al.Xianping Guo ... Liuer Ye
01 Dec 2010
Advances in Applied Probability | VOL. 42

Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion
M Yu Kitayev
Theory of Probability & Its Applications | VOL. 30
M Yu KitayevM Yu Kitayev
01 Jun 1986
Theory of Probability & Its Applications | VOL. 30

Denumerable state continuous time Markov decision processes with unbounded cost and transition rates under average criterion
Xianping Guo ... Weiping Zhu
The ANZIAM Journal | VOL. 43
Xianping Guo, et. al.Xianping Guo ... Weiping Zhu
01 Apr 2002
The ANZIAM Journal | VOL. 43

Continuous-Time Markov Decision Processes with State-Dependent Discount Factors
Liuer Ye ... Xianping Guo
Acta Applicandae Mathematicae | VOL. 121
Liuer Ye, et. al.Liuer Ye ... Xianping Guo
24 Feb 2012
Acta Applicandae Mathematicae | VOL. 121

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

New discount and average optimality conditions for continuous-time Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Advances in Applied Probability