Optimal decisions for continuous time Markov decision processes over finite planning horizons

Peter Buchholz,Iryna Dohndorf,Dimitri Scheftelowitsch

doi:10.1016/j.cor.2016.08.003

Abstract

The computation of ϵ-optimal policies for continuous time Markov decision processes (CTMDPs) over finite time intervals is a sophisticated problem because the optimal policy may change at arbitrary times. Numerical algorithms based on time discretization or uniformization have been proposed for the computation of optimal policies. The uniformization based algorithm has shown to be more reliable and often also more efficient but is currently only available for processes where the gain or reward does not depend on the decision taken in a state. In this paper, we present two new uniformization based algorithms for computing ϵ-optimal policies for CTMDPs with decision dependent rewards over a finite time horizon. Due to a new and tighter upper bound the newly proposed algorithms cannot only be applied for decision dependent rewards, they also outperform the available approach for rewards that do not depend on the decision. In particular for models where the policy only rarely changes, optimal policies can be computed much faster.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal decisions for continuous time Markov decision processes over finite planning horizons

Abstract

Talk to us

Similar Papers

More From: Computers and Operations Research

Lead the way for us

Journal: Computers and Operations Research	Publication Date: Aug 10, 2016
Citations: 3

Similar Papers

Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion
M Yu Kitayev
Theory of Probability & Its Applications | VOL. 30
M Yu KitayevM Yu Kitayev
01 Jun 1986
Theory of Probability & Its Applications | VOL. 30

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

Numerical analysis of continuous time Markov decision processes over finite horizons
Peter Buchholz ... Ingo Schulz
Computers & Operations Research | VOL. 38
Peter Buchholz, et. al.Peter Buchholz ... Ingo Schulz
26 Aug 2010
Computers & Operations Research | VOL. 38

Continuous time markov decision processes with nonuniformly bounded transition rate: expected total rewards
Qiying Hu * ... Jinling Wang
Optimization | VOL. 43
Qiying Hu *, et. al.Qiying Hu * ... Jinling Wang
01 Jan 1998
Optimization | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal decisions for continuous time Markov decision processes over finite planning horizons

Abstract

Talk to us

Similar Papers

More From: Computers and Operations Research