Risk-Averse Decision Making Under Uncertainty

Mohamadreza Ahmadi,Michel D Ingham,Aaron D Ames,Richard M Murray,Ugo Rosolia

doi:10.1109/tac.2023.3264178

Abstract

A large class of decision making under uncertainty problems can be described via Markov decision processes (MDPs) or partially observable MDPs (POMDPs), with application to artificial intelligence and operations research, among others. In this paper, we consider the problem of designing policies for MDPs and POMDPs with objectives and constraints in terms of dynamic coherent risk measures rather than the traditional total expectation, which we refer to as the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">constrained risk-averse problem</i> . Our contributions can be described as follows: For MDPs, under some mild assumptions, we propose an optimization-based method to synthesize Markovian policies. We then demonstrate that such policies can be found by solving difference convex programs (DCPs). We show that our formulation generalize linear programs for constrained MDPs with total discounted expected costs and constraints; For POMDPs, we show that, if the coherent risk measures can be defined as a Markov risk transition mapping, an infinite-dimensional optimization can be used to design Markovian belief-based policies. For POMDPs with stochastic finite-state controllers (FSCs), we show that the latter optimization simplifies to a (finite-dimensional) DCP. We incorporate these DCPs in a policy iteration algorithm to design risk-averse FSCs for POMDPs. We demonstrate the efficacy of the proposed method with numerical experiments involving conditional-value-at-risk (CVaR) and entropic-value-at-risk (EVaR) risk measures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Automatic Control	Publication Date: Jan 1, 2024
Citations: 2	License type: mit

R Discovery Prime

R Discovery Prime

Risk-Averse Decision Making Under Uncertainty

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control

Lead the way for us

Similar Papers

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

POMDP-based online target detection and recognition for autonomous UAVs
...
-
, et. al. ...
03 Sep 2014
03 Sep 2014

A Bayesian game based adaptive fuzzy controller for multiagent POMDPs
Rajneesh Sharma ... Matthijs T J Spaan
-
Rajneesh Sharma, et. al.Rajneesh Sharma ... Matthijs T J Spaan
01 Jul 2010
01 Jul 2010

Risk Measures and Nonlinear Expectations
Zengjing Chen ... Kun He
Journal of Mathematical Finance | VOL. 03
Zengjing Chen, et. al.Zengjing Chen ... Kun He
01 Jan 2013
Journal of Mathematical Finance | VOL. 03

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Risk-Averse Decision Making Under Uncertainty

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control