Finite horizon continuous-time Markov decision processes with mean and variance criteria

Yonghui Huang

doi:10.1007/s10626-018-0273-1

Abstract

This paper studies mean maximization and variance minimization problems in finite horizon continuous-time Markov decision processes. The state and action spaces are assumed to be Borel spaces, while reward functions and transition rates are allowed to be unbounded. For the mean problem, we design a method called successive approximation, which enables us to prove the existence of a solution to the Hamilton-Jacobi-Bellman (HJB) equation, and then the existence of a mean-optimal policy under some growth and compact-continuity conditions. For the variance problem, using the first-jump analysis, we succeed in converting the second moment of the finite horizon reward to a mean of a finite horizon reward with new reward functions under suitable conditions, based on which the associated HJB equation for the variance problem and the existence of variance-optimal policies are established. Value iteration algorithms for computing mean- and variance-optimal policies are proposed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finite horizon continuous-time Markov decision processes with mean and variance criteria

Abstract

Talk to us

Similar Papers

More From: Discrete Event Dynamic Systems

Lead the way for us

Journal: Discrete Event Dynamic Systems	Publication Date: Sep 29, 2018
Citations: 5

Similar Papers

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes
Yonghui Huang ... Xianping Guo
Applied Mathematics & Optimization | VOL. 72
Yonghui Huang, et. al.Yonghui Huang ... Xianping Guo
27 Nov 2014
Applied Mathematics & Optimization | VOL. 72

Learning Algorithms for Price Control in an Internet-Based Dutch Auction
K Ravikumar ... Diatha Krishna Sundar
SSRN Electronic Journal | VOL. -
K Ravikumar, et. al.K Ravikumar ... Diatha Krishna Sundar
20 Oct 2001
SSRN Electronic Journal | VOL. -

Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion
M Yu Kitayev
Theory of Probability & Its Applications | VOL. 30
M Yu KitayevM Yu Kitayev
01 Jun 1986
Theory of Probability & Its Applications | VOL. 30

Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
Naoyuki Ichihara
Applied Mathematics & Optimization | VOL. 84
Naoyuki IchiharaNaoyuki Ichihara
04 Aug 2020
Applied Mathematics & Optimization | VOL. 84

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finite horizon continuous-time Markov decision processes with mean and variance criteria

Abstract

Talk to us

Similar Papers

More From: Discrete Event Dynamic Systems