Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains

Rolando Cavazos-Cadena

doi:10.1287/moor.2017.0893

Abstract

This work is concerned with Markov decision chains on a denumerable state space. The controller has a positive risk-sensitivity coefficient, and the performance of a control policy is measured by a risk-sensitive average cost criterion. Besides standard continuity-compactness conditions, it is assumed that the state process is communicating under any stationary policy, and that the simultaneous Doeblin condition holds. In this context, it is shown that if the cost function is bounded from below, and the superior limit average index is finite at some point, then (i) the optimal superior and inferior limit average value functions coincide and are constant, (ii) the optimal average cost is characterized via an extended version of the Collatz-Wielandt formula in the theory of positive matrices, and (iii) an optimality inequality is established, from which a stationary optimal policy is obtained. Moreover, an explicit example is given to show that, even if the cost function is bounded, the strict inequality may occur in the optimality relation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains

Abstract

Talk to us

Similar Papers

More From: Mathematics of Operations Research

Lead the way for us

Journal: Mathematics of Operations Research	Publication Date: Aug 1, 2018
Citations: 18

Similar Papers

Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions
Rolando Cavazos-Cadena ... Emmanuel Fernández-Gaucherand
Mathematical Methods of Operations Research | VOL. 49
Rolando Cavazos-Cadena, et. al.Rolando Cavazos-Cadena ... Emmanuel Fernández-Gaucherand
01 Apr 1999
Mathematical Methods of Operations Research | VOL. 49

The Value Iteration Algorithm in Risk-Sensitive Average Markov Decision Chains with Finite State Space
Rolando Cavazos-Cadena ... Raúl Montes-De-Oca
Mathematics of Operations Research | VOL. 28
Rolando Cavazos-Cadena, et. al.Rolando Cavazos-Cadena ... Raúl Montes-De-Oca
01 Nov 2003
Mathematics of Operations Research | VOL. 28

A counterexample on the optimality equation in Markov decision chains with the average cost criterion
Rolando Cavazos-Cadena
Systems & Control Letters | VOL. 16
Rolando Cavazos-CadenaRolando Cavazos-Cadena
01 May 1991
Systems & Control Letters | VOL. 16

Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion
Selene Chávez-Rodríguez ... Rolando Cavazos-Cadena
Journal of Optimization Theory and Applications | VOL. 170
Selene Chávez-Rodríguez, et. al.Selene Chávez-Rodríguez ... Rolando Cavazos-Cadena
11 Mar 2016
Journal of Optimization Theory and Applications | VOL. 170

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains

Abstract

Talk to us

Similar Papers

More From: Mathematics of Operations Research