Average optimality inequality for continuous-time Markov decision processes in Polish spaces

Quanxin Zhu

doi:10.1007/s00186-007-0157-x

Abstract

In this paper, we study the average optimality for continuous-time controlled jump Markov processes in general state and action spaces. The criterion to be minimized is the average expected costs. Both the transition rates and the cost rates are allowed to be unbounded. We propose another set of conditions under which we first establish one average optimality inequality by using the well-known “vanishing discounting factor approach”. Then, when the cost (or reward) rates are nonnegative (or nonpositive), from the average optimality inequality we prove the existence of an average optimal stationary policy in all randomized history dependent policies by using the Dynkin formula and the Tauberian theorem. Finally, when the cost (or reward) rates have neither upper nor lower bounds, we also prove the existence of an average optimal policy in all (deterministic) stationary policies by constructing a “new” cost (or reward) rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

Abstract

Talk to us

Similar Papers

More From: Mathematical Methods of Operations Research

Lead the way for us

Journal: Mathematical Methods of Operations Research	Publication Date: Jul 19, 2007
Citations: 24

Similar Papers

Strong average optimality criterion for continuous-time Markov decision processes
Qingda Wei ... Xian Chen
Kybernetika | VOL. 50
Qingda Wei, et. al.Qingda Wei ... Xian Chen
02 Jan 2015
Kybernetika | VOL. 50

Average optimality for continuous-time Markov decision processes in Polish spaces
Xianping Guo ... Ulrich Rieder
The Annals of Applied Probability | VOL. 16
Xianping Guo, et. al.Xianping Guo ... Ulrich Rieder
01 May 2006
The Annals of Applied Probability | VOL. 16

Policy Iteration for Continuous-Time Average Reward Markov Decision Processes in Polish Spaces
Quanxin Zhu ... Chuangxia Huang
Abstract and Applied Analysis | VOL. 2009
Quanxin Zhu, et. al.Quanxin Zhu ... Chuangxia Huang
01 Jan 2009
Abstract and Applied Analysis | VOL. 2009

Average sample-path optimality for continuous-time Markov decision processes in Polish spaces
Quan-Xin Zhu
Acta Mathematicae Applicatae Sinica, English Series | VOL. 27
Quan-Xin ZhuQuan-Xin Zhu
09 Sep 2011
Acta Mathematicae Applicatae Sinica, English Series | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

Abstract

Talk to us

Similar Papers

More From: Mathematical Methods of Operations Research