Equivalence of Lyapunov stability criteria in a class of Markov decision processes

Rolando Cavazos-Cadena,On�simo Hern�ndez-Lerma

doi:10.1007/bf01189027

Abstract

We are concerned with Markov decision processes with countable state space and discrete-time parameter. The main structural restriction on the model is the following: under the action of any stationary policy the state space is acommunicating class. In this context, we prove the equivalence of ten stability/ergodicity conditions on the transition law of the model, which imply the existence of average optimal stationary policies for an arbitrary continuous and bounded reward function; these conditions include the Lyapunov function condition (LFC) introduced by A. Hordijk. As a consequence of our results, the LFC is proved to be equivalent to the following: under the action of any stationary policy the corresponding Markov chain has a unique invariant distribution which depends continuously on the stationary policy being used. A weak form of the latter condition was used by one of the authors to establish the existence of optimal stationary policies using an approach based on renewal theory.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Equivalence of Lyapunov stability criteria in a class of Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Applied Mathematics & Optimization

Lead the way for us

Journal: Applied Mathematics & Optimization	Publication Date: Sep 1, 1992
Citations: 21

Similar Papers

A Note on the Existence of Optimal Policies in Total Reward Dynamic Programs with Compact Action Sets
Rolando Cavazos-Cadena ... Eugene A Feinberg
Mathematics of Operations Research | VOL. 25
Rolando Cavazos-Cadena, et. al.Rolando Cavazos-Cadena ... Eugene A Feinberg
01 Nov 2000
Mathematics of Operations Research | VOL. 25

A note on the existence of optimal stationary policies for average Markov decision processes with countable states
Li Xia ... Xi-Ren Cao
Automatica | VOL. 151
Li Xia, et. al.Li Xia ... Xi-Ren Cao
16 Feb 2023
Automatica | VOL. 151

Recent results on conditions for the existence of average optimal stationary policies
Rolando Cavazos-Cadena
Annals of Operations Research | VOL. 28
Rolando Cavazos-CadenaRolando Cavazos-Cadena
01 Dec 1991
Annals of Operations Research | VOL. 28

Policy improvement algorithm for continuous time Markov decision processes with switching costs
Bharat Doshi
-
Bharat DoshiBharat Doshi
01 Jan 1979
01 Jan 1979

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Equivalence of Lyapunov stability criteria in a class of Markov decision processes

Abstract

Talk to us

Similar Papers

More From: Applied Mathematics &amp; Optimization

More From: Applied Mathematics & Optimization