Stochastic approximations of constrained discounted Markov decision processes

François Dufour,Tomás Prieto-Rumeau

doi:10.1016/j.jmaa.2013.12.016

Abstract

We consider a discrete-time constrained Markov decision process under the discounted cost optimality criterion. The state and action spaces are assumed to be Borel spaces, while the cost and constraint functions might be unbounded. We are interested in approximating numerically the optimal discounted constrained cost. To this end, we suppose that the transition kernel of the Markov decision process is absolutely continuous with respect to some probability measure μ. Then, by solving the linear programming formulation of a constrained control problem related to the empirical probability measure μn of μ, we obtain the corresponding approximation of the optimal constrained cost. We derive a concentration inequality which gives bounds on the probability that the estimation error is larger than some given constant. This bound is shown to decrease exponentially in n. Our theoretical results are illustrated with a numerical application based on a stochastic version of the Beverton–Holt population model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Mathematical Analysis and Applications	Publication Date: Dec 16, 2013
Citations: 18	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Stochastic approximations of constrained discounted Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Analysis and Applications

Lead the way for us

Similar Papers

Weakly Coupled Constrained Markov Decision Processes in Borel Spaces
Mukul Gagrani ... Ashutosh Nayyar
-
Mukul Gagrani, et. al.Mukul Gagrani ... Ashutosh Nayyar
01 Jul 2020
01 Jul 2020

On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies
Huizhen Yu
Journal of Mathematical Analysis and Applications | VOL. 509
Huizhen YuHuizhen Yu
28 Dec 2021
Journal of Mathematical Analysis and Applications | VOL. 509

On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces
Naci Saldi ... Tamás Linder
Mathematics of Operations Research | VOL. 42
Naci Saldi, et. al.Naci Saldi ... Tamás Linder
01 Nov 2017
Mathematics of Operations Research | VOL. 42

Computable approximations for average Markov decision processes in continuous time
Jonatha Anselmi ... François Dufour
Journal of Applied Probability | VOL. 55
Jonatha Anselmi, et. al.Jonatha Anselmi ... François Dufour
01 Jun 2018
Journal of Applied Probability | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic approximations of constrained discounted Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Analysis and Applications