On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces

Naci Saldi,Serdar Yüksel,Tamás Linder

doi:10.1287/moor.2016.0832

Abstract

Calculating optimal policies is known to be computationally difficult for Markov decision processes (MDPs) with Borel state and action spaces. This paper studies finite-state approximations of discrete time Markov decision processes with Borel state and action spaces, for both discounted and average costs criteria. The stationary policies thus obtained are shown to approximate the optimal stationary policy with arbitrary precision under quite general conditions for discounted cost and more restrictive conditions for average cost. For compact-state MDPs, we obtain explicit rate of convergence bounds quantifying how the approximation improves as the size of the approximating finite state space increases. Using information theoretic arguments, the order optimality of the obtained convergence rates is established for a large class of problems. We also show that as a pre-processing step, the action space can also be finitely approximated with sufficiently large number points; thereby, well known algorithms, such as value or policy iteration, Q-learning, etc., can be used to calculate near optimal policies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces

Abstract

Talk to us

Similar Papers

More From: Mathematics of Operations Research

Lead the way for us

Journal: Mathematics of Operations Research	Publication Date: Nov 1, 2017
Citations: 39

Similar Papers

Finite-state approximation of Markov decision processes with unbounded costs and Borel spaces
Naci Saldi ... Serdar Yuksel
-
Naci Saldi, et. al.Naci Saldi ... Serdar Yuksel
01 Dec 2015
01 Dec 2015

Finite state approximations of Markov decision processes with general state and action spaces
Naci Saldi ... Tamas Linder
-
Naci Saldi, et. al.Naci Saldi ... Tamas Linder
01 Jul 2015
01 Jul 2015

Discrete type shock semi-markov decision processes with borel state space
Qiying Hu
Optimization | VOL. 28
Qiying HuQiying Hu
01 Jan 1993
Optimization | VOL. 28

Average optimality for Markov decision processes in borel spaces: a new condition and approach
Xianping Guo ... Quanxin Zhu
Journal of Applied Probability | VOL. 43
Xianping Guo, et. al.Xianping Guo ... Quanxin Zhu
01 Jun 2006
Journal of Applied Probability | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces

Abstract

Talk to us

Similar Papers

More From: Mathematics of Operations Research