The risk probability criterion for discounted continuous-time Markov decision processes

Haifeng Huo,Xiaolong Zou,Xianping Guo

doi:10.1007/s10626-017-0257-6

Abstract

In this paper, we consider the risk probability minimization problem for infinite discounted continuous-time Markov decision processes (CTMDPs) with unbounded transition rates. First, we introduce a class of policies depending on histories with the additional reward levels. Then, we construct the corresponding probability spaces, and also establish the non-explosion of the state process. Secondly, under suitable conditions we prove that the value function is a solution to the optimality equation for the probability criterion by an iteration technique, and obtain a value iteration algorithm to compute (at least approximate) the value function. Furthermore, under an additional condition we establish the uniqueness of the solution to the optimality equation and the existence of an optimal policy. Finally, we illustrate our results with two examples. The first one is used to verify our conditions for CTMDPs with unbounded transition rates, the second one for the numerical calculation of the value function and an optimal policy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The risk probability criterion for discounted continuous-time Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Discrete Event Dynamic Systems

Lead the way for us

Journal: Discrete Event Dynamic Systems	Publication Date: Aug 10, 2017
Citations: 15

Similar Papers

Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion
M Yu Kitayev
Theory of Probability & Its Applications | VOL. 30
M Yu KitayevM Yu Kitayev
01 Jun 1986
Theory of Probability & Its Applications | VOL. 30

Risk Probability Minimization Problems for Continuous-Time Markov Decision Processes on Finite Horizon
Haifeng Huo ... Xianping Guo
IEEE Transactions on Automatic Control | VOL. 65
Haifeng Huo, et. al.Haifeng Huo ... Xianping Guo
25 Oct 2019
IEEE Transactions on Automatic Control | VOL. 65

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

Finite-horizon optimality for continuous-time Markov decision processes with unbounded transition rates
Xianping Guo ... Xiangxiang Huang
Advances in Applied Probability | VOL. 47
Xianping Guo, et. al.Xianping Guo ... Xiangxiang Huang
01 Dec 2015
Advances in Applied Probability | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The risk probability criterion for discounted continuous-time Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Discrete Event Dynamic Systems