Minimizing spectral risk measures applied to Markov decision processes

Nicole Bäuerle,Alexander Glauner

doi:10.1007/s00186-021-00746-w

Abstract

We study the minimization of a spectral risk measure of the total discounted cost generated by a Markov Decision Process (MDP) over a finite or infinite planning horizon. The MDP is assumed to have Borel state and action spaces and the cost function may be unbounded above. The optimization problem is split into two minimization problems using an infimum representation for spectral risk measures. We show that the inner minimization problem can be solved as an ordinary MDP on an extended state space and give sufficient conditions under which an optimal policy exists. Regarding the infinite dimensional outer minimization problem, we prove the existence of a solution and derive an algorithm for its numerical approximation. Our results include the findings in Bäuerle and Ott (Math Methods Oper Res 74(3):361–379, 2011) in the special case that the risk measure is Expected Shortfall. As an application, we present a dynamic extension of the classical static optimal reinsurance problem, where an insurance company minimizes its cost of capital.

Highlights

There have been various proposals to replace the expectation in the optimization of Markov Decision Processes (MDPs) by risk measures
The recursive approach for general MDP can for example be found in Ruszczynski (2010); Chu and Zhang (2014); Bäuerle and Glauner (2021)
The theory for these kind of models is rather different to the ones where the risk measures is applied to the total cost, since in the recursive approach we still get a recursive solution procedure directly

Summary

Introduction

There have been various proposals to replace the expectation in the optimization of Markov Decision Processes (MDPs) by risk measures. The recursive approach for general MDP can for example be found in Ruszczynski (2010); Chu and Zhang (2014); Bäuerle and Glauner (2021) The theory for these kind of models is rather different to the ones where the risk measures is applied to the total cost, since in the recursive approach we still get a recursive solution procedure directly. The inner problem is to minimize the expected convex function of the total cost It can be solved with MDP techniques after a suitable extension of the original state space. We treat the outer optimization problem and state the existence of an optimal function in the representation of the spectral risk measure. All proofs and detailed derivations of our results are deferred to the appendix

Spectral risk measures

Markov decision model

Inner problem

Solution of the extended MDP

Outer problem: existence and numerical approximation

Infinite planning horizon

Relaxed assumptions for monotone models

Dynamic optimal reinsurance

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical methods of operations research (Heidelberg, Germany)	Publication Date: Jul 27, 2021
Citations: 10	License type: open-access

R Discovery Prime

R Discovery Prime

Minimizing spectral risk measures applied to Markov decision processes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical methods of operations research (Heidelberg, Germany)

Lead the way for us

Similar Papers

Markov decision processes with recursive risk measures
Nicole Bäuerle ... Alexander Glauner
European Journal of Operational Research | VOL. 296
Nicole Bäuerle, et. al.Nicole Bäuerle ... Alexander Glauner
24 Apr 2021
European Journal of Operational Research | VOL. 296

Solvency II, regulatory capital, and optimal reinsurance: How good are Conditional Value-at-Risk and spectral risk measures?
Mario Brandtner ... Wolfgang Kürsten
Insurance Mathematics and Economics | VOL. 59
Mario Brandtner, et. al.Mario Brandtner ... Wolfgang Kürsten
02 Oct 2014
Insurance Mathematics and Economics | VOL. 59

Bidding strategy of integrated energy system considering decision maker’s subjective risk aversion
Yangyang Liu ... Feng Yu
Applied energy | VOL. 341
Yangyang Liu, et. al.Yangyang Liu ... Feng Yu
24 Apr 2023
Applied energy | VOL. 341

On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces
Naci Saldi ... Tamás Linder
Mathematics of Operations Research | VOL. 42
Naci Saldi, et. al.Naci Saldi ... Tamás Linder
01 Nov 2017
Mathematics of Operations Research | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimizing spectral risk measures applied to Markov decision processes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical methods of operations research (Heidelberg, Germany)