Planning for Risk-Aversion and Expected Value in MDPs

Marc Rigter,Nick Hawes,Bruno Lacerda,Paul Duckworth

doi:10.1609/icaps.v32i1.19814

Abstract

Planning in Markov decision processes (MDPs) typically optimises the expected cost. However, optimising the expectation does not consider the risk that for any given run of the MDP, the total cost received may be unacceptably high. An alternative approach is to find a policy which optimises a riskaverse objective such as conditional value at risk (CVaR). However, optimising the CVaR alone may result in poor performance in expectation. In this work, we begin by showing that there can be multiple policies which obtain the optimal CVaR. This motivates us to propose a lexicographic approach which minimises the expected cost subject to the constraint that the CVaR of the total cost is optimal. We present an algorithm for this problem and evaluate our approach on four domains. Our results demonstrate that our lexicographic approach improves the expected cost compared to the state of the art algorithm, while achieving the optimal CVaR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Planning for Risk-Aversion and Expected Value in MDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Journal: Proceedings of the International Conference on Automated Planning and Scheduling	Publication Date: Jun 13, 2022
Citations: 4

Similar Papers

Risk aversion in finite Markov Decision Processes using total cost criteria and average value at risk
Stefano Carpin ... Yin-Lam Chow
-
Stefano Carpin, et. al.Stefano Carpin ... Yin-Lam Chow
01 May 2016
01 May 2016

Economically Efficient Power Storage Operation by Dealing with the Non-Normality of Power Prediction
Shiro Yano ... Tadahiro Taniguchi
Energies | VOL. 8
Shiro Yano, et. al.Shiro Yano ... Tadahiro Taniguchi
27 Oct 2015
Energies | VOL. 8

A Markov Decision Model for a Surveillance Application and Risk-Sensitive Markov Decision Processes

-

01 Jan 2009
01 Jan 2009

Risk-Aware Optimization of Age of Information in the Internet of Things
Bo Zhou ... Walid Saad
-
Bo Zhou, et. al.Bo Zhou ... Walid Saad
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Planning for Risk-Aversion and Expected Value in MDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling