Average cost Markov Decision Processes: Optimality conditions

O Hernández-Lerma

doi:10.1016/0022-247x(91)90244-t

Abstract

We are concerned in this paper with discrete-time Markov Decision Processes (MDPs) with Borel state and action spaces X and A, respectively, and the long run expected average cost criterion. When X is a denumerable set, man y necessary and/or sufficient conditions for the existence of optimal control policies are known. However, when X is a Borel space (i.e., a Borel subset of a complete separable metric space), most of the available results impose on the MDP very restrictive topological conditions (e.g., compactness) and/or strong recurrence assumptions (such as Doeblin's condition); see, e.g., [4,9, 12] and their references. Another related work is [7] where we have studied MDPs from the viewpoint of the recurrence (or ergodicity) properties of the state process. ln the present paper, however, we are concerned with the existence of average optimal policies by looking at (static) optimization problems (see condition CS in Section 4) related-in sorne cases equivalent-to the existence of a bounded solution to the so-called Optimality Equation (see C4 in Section 4). These optimization problems are dual in the sense that, under appropriate conditions, the existence of an optimal solution to one of the problems implies existence of an optimal

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Mathematical Analysis and Applications	Publication Date: Jul 1, 1991
Citations: 29	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Average cost Markov Decision Processes: Optimality conditions

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Analysis and Applications

Lead the way for us

Similar Papers

On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces
Naci Saldi ... Tamás Linder
Mathematics of Operations Research | VOL. 42
Naci Saldi, et. al.Naci Saldi ... Tamás Linder
01 Nov 2017
Mathematics of Operations Research | VOL. 42

Discrete-time control with non-constant discount factor
Héctor Jasso-Fuentes ... Tomás Prieto-Rumeau
Mathematical Methods of Operations Research | VOL. 92
Héctor Jasso-Fuentes, et. al.Héctor Jasso-Fuentes ... Tomás Prieto-Rumeau
27 Jun 2020
Mathematical Methods of Operations Research | VOL. 92

Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities
François Dufour ... Tomás Prieto-Rumeau
Stochastics An International Journal of Probability and Stochastic Processes | VOL. 87
François Dufour, et. al.François Dufour ... Tomás Prieto-Rumeau
07 Nov 2014
Stochastics An International Journal of Probability and Stochastic Processes | VOL. 87

Finite-state approximation of Markov decision processes with unbounded costs and Borel spaces
Naci Saldi ... Serdar Yuksel
-
Naci Saldi, et. al.Naci Saldi ... Serdar Yuksel
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Average cost Markov Decision Processes: Optimality conditions

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Analysis and Applications