Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes

François Dufour,Tomás Prieto-Rumeau

doi:10.1137/120867925

Abstract

We consider a Markov decision process (MDP) with constraints under the total expected discounted cost optimality criterion. We are interested in proposing approximation methods of the optimal value of this constrained MDP. To this end, starting from the linear programming (LP) formulation of the constrained MDP (on an infinite-dimensional space of measures), we propose a finite state approximation of this LP problem. This is achieved by suitably approximating a probability measure underlying the random transitions of the dynamics of the system. Explicit convergence orders of the approximations of the optimal constrained cost are obtained. By exploiting convexity properties of the class of relaxed controls, we reduce the LP formulation of the constrained MDP to a finite-dimensional static optimization problem that can be used to obtain explicit numerical approximations of the corresponding optimal constrained cost. A numerical application illustrates our theoretical results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes

Abstract

Talk to us

Similar Papers

More From: SIAM Journal on Control and Optimization

Lead the way for us

Journal: SIAM Journal on Control and Optimization	Publication Date: Jan 1, 2013
Citations: 45

Similar Papers

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

A new LP formulation of the admission control problem modelled as an MDP under average reward criterion
Antonio Pietrabissa
International Journal of Systems Science | VOL. 42
Antonio PietrabissaAntonio Pietrabissa
01 Dec 2011
International Journal of Systems Science | VOL. 42

Linear programming formulation for non-stationary, finite-horizon Markov decision process models
Arnab Bhattacharya ... Jeffrey P Kharoufeh
Operations Research Letters | VOL. 45
Arnab Bhattacharya, et. al.Arnab Bhattacharya ... Jeffrey P Kharoufeh
17 Sep 2017
Operations Research Letters | VOL. 45

Parameter dependentH∞ control by finite dimensional LMI optimization: application to trade-off dependent control
M Dinh ... E Magarotto
International Journal of Robust and Nonlinear Control | VOL. 15
M Dinh, et. al.M Dinh ... E Magarotto
01 Jan 2004
International Journal of Robust and Nonlinear Control | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes

Abstract

Talk to us

Similar Papers

More From: SIAM Journal on Control and Optimization