Continuous-Time Markov Decision Processes with State-Dependent Discount Factors

Liuer Ye,Xianping Guo

doi:10.1007/s10440-012-9669-3

Abstract

We consider continuous-time Markov decision processes in Polish spaces. The performance of a control policy is measured by the expected discounted reward criterion associated with state-dependent discount factors. All underlying Markov processes are determined by the given transition rates which are allowed to be unbounded, and the reward rates may have neither upper nor lower bounds. By using the dynamic programming approach, we establish the discounted reward optimality equation (DROE) and the existence and uniqueness of its solutions. Under suitable conditions, we also obtain a discounted optimal stationary policy which is optimal in the class of all randomized stationary policies. Moreover, when the transition rates are uniformly bounded, we provide an algorithm to compute (or at least to approximate) the discounted reward optimal value function as well as a discounted optimal stationary policy. Finally, we use an example to illustrate our results. Specially, we first derive an explicit and exact solution to the DROE and an explicit expression of a discounted optimal stationary policy for such an example.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Continuous-Time Markov Decision Processes with State-Dependent Discount Factors

Abstract

Talk to us

Similar Papers

More From: Acta Applicandae Mathematicae

Lead the way for us

Journal: Acta Applicandae Mathematicae	Publication Date: Feb 24, 2012
Citations: 26

Similar Papers

Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous‐Time Markov Decision Processes
Xiao Wu ... Yanqiu Tang
Discrete Dynamics in Nature and Society | VOL. 2022
Xiao Wu, et. al.Xiao Wu ... Yanqiu Tang
01 Jan 2021
Discrete Dynamics in Nature and Society | VOL. 2022

Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion
M Yu Kitayev
Theory of Probability & Its Applications | VOL. 30
M Yu KitayevM Yu Kitayev
01 Jun 1986
Theory of Probability & Its Applications | VOL. 30

New discount and average optimality conditions for continuous-time Markov decision processes
Xianping Guo ... Liuer Ye
Advances in Applied Probability | VOL. 42
Xianping Guo, et. al.Xianping Guo ... Liuer Ye
01 Dec 2010
Advances in Applied Probability | VOL. 42

New discount and average optimality conditions for continuous-time Markov decision processes
Xianping Guo ... Liuer Ye
Advances in Applied Probability | VOL. 42
Xianping Guo, et. al.Xianping Guo ... Liuer Ye
01 Dec 2010
Advances in Applied Probability | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous-Time Markov Decision Processes with State-Dependent Discount Factors

Abstract

Talk to us

Similar Papers

More From: Acta Applicandae Mathematicae