Risk-sensitive planning in partially observable environments

Janusz Marecki ,Pradeep Varakantham

doi:10.5555/1838206.1838384

Abstract

Partially Observable Markov Decision Process (POMDP) is a popular framework for planning under uncertainty in partially observable domains. Yet, the POMDP model is risk-neutral in that it assumes that the agent is maximizing the expected reward of its actions. In contrast, in domains like financial planning, it is often required that the agent decisions are risk-sensitive (maximize the utility of agent actions, for non-linear utility functions). Unfortunately, existing POMDP solvers cannot solve such planning problems exactly. By considering piecewise linear approximations of utility functions, this paper addresses this shortcoming in three contributions: (i) It defines the Risk-Sensitive POMDP model; (ii) It derives the fundamental properties of the underlying value functions and provides a functional value iteration technique to compute them exactly and (c) It proposes an efficient procedure to determine the dominated value functions, to speed up the algorithm. Our experiments show that the proposed approach is feasible and applicable to realistic financial planning domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Risk-sensitive planning in partially observable environments

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

POMDP-based online target detection and recognition for autonomous UAVs
...
-
, et. al. ...
03 Sep 2014
03 Sep 2014

Closing the learning-planning loop with predictive state representations
...
-
, et. al. ...
10 May 2010
10 May 2010

A POMDP Based Routing Model to Enhance Directed Diffusion in Wireless Sensor Networks
...
-
, et. al. ...
07 Dec 2013
07 Dec 2013

Relational approach to knowledge engineering for POMDP-based assistance systems as a translation of a psychological model
Marek Grześ ... Andrew Monk
International Journal of Approximate Reasoning | VOL. 55
Marek Grześ, et. al.Marek Grześ ... Andrew Monk
23 Mar 2013
International Journal of Approximate Reasoning | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Risk-sensitive planning in partially observable environments

Abstract

Talk to us

Similar Papers