Discounted continuous-time Markov decision processes with unbounded rates and randomized history-dependent policies: the dynamic programming approach

Alexey Piunovskiy,Yi Zhang

doi:10.1007/s10288-013-0236-1

Discounted continuous-time Markov decision processes with unbounded rates and randomized history-dependent policies: the dynamic programming approach

Alexey Piunovskiy, Yi Zhang

https://doi.org/10.1007/s10288-013-0236-1

Copy DOI

Journal: 4OR	Publication Date: Mar 31, 2013
Citations: 24

Affiliation: University of Liverpool

#Continuous-time Markov Decision Process #History-dependent Policies + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper deals with a continuous-time Markov decision process in Borel state and action spaces and with unbounded transition rates. Under history-dependent policies, the controlled process may not be Markov. The main contribution is that for such non-Markov processes we establish the Dynkin formula, which plays important roles in establishing optimality results for continuous-time Markov decision processes. We further illustrate this by showing, for a discounted continuous-time Markov decision process, the existence of a deterministic stationary optimal policy (out of the class of history-dependent policies) and characterizing the value function through the Bellman equation.

Full Text