Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities

François Dufour,Tomás Prieto-Rumeau

doi:10.1080/17442508.2014.939979

Abstract

We consider a discrete-time Markov decision process with Borel state and action spaces, and possibly unbounded cost function. We assume that the Markov transition kernel is absolutely continuous with respect to some probability measure . By replacing this probability measure with its empirical distribution for a sample of size n, we obtain a finite state space control problem, which is used to provide an approximation of the optimal value and an optimal policy of the original control model. We impose Lipschitz continuity properties on the control model and its associated density functions. We measure the accuracy of the approximation of the optimal value and an optimal policy by means of a non-asymptotic concentration inequality based on the 1-Wasserstein distance between and . Obtaining numerically the solution of the approximating control model is discussed and an application to an inventory management problem is presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities

Abstract

Talk to us

Similar Papers

More From: Stochastics An International Journal of Probability and Stochastic Processes

Lead the way for us

Journal: Stochastics An International Journal of Probability and Stochastic Processes	Publication Date: Nov 7, 2014
Citations: 27

Similar Papers

Average cost Markov Decision Processes: Optimality conditions
O Hernández-Lerma
Journal of Mathematical Analysis and Applications | VOL. 158
O Hernández-LermaO Hernández-Lerma
01 Jul 1991
Journal of Mathematical Analysis and Applications | VOL. 158

On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces
Naci Saldi ... Tamás Linder
Mathematics of Operations Research | VOL. 42
Naci Saldi, et. al.Naci Saldi ... Tamás Linder
01 Nov 2017
Mathematics of Operations Research | VOL. 42

Finite-state approximation of Markov decision processes with unbounded costs and Borel spaces
Naci Saldi ... Serdar Yuksel
-
Naci Saldi, et. al.Naci Saldi ... Serdar Yuksel
01 Dec 2015
01 Dec 2015

Discrete type shock semi-markov decision processes with borel state space
Qiying Hu
Optimization | VOL. 28
Qiying HuQiying Hu
01 Jan 1993
Optimization | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities

Abstract

Talk to us

Similar Papers

More From: Stochastics An International Journal of Probability and Stochastic Processes