Safe reinforcement learning using risk mapping by similarity

Jonathan Serrano-Cuevas,Pablo Hernández-Leal,Eduardo F Morales

doi:10.1177/1059712319859650

Abstract

Reinforcement learning (RL) has been used to successfully solve sequential decision problem. However, considering risk at the same time as the learning process is an open research problem. In this work, we are interested in the type of risk that can lead to a catastrophic state. Related works that aim to deal with risk propose complex models. In contrast, we follow a simple, yet effective, idea: similar states might lead to similar risk. Using this idea, we propose risk mapping by similarity (RMS), an algorithm for discrete scenarios which infers the risk of newly discovered states by analyzing how similar they are to previously known risky states. In general terms, the RMS algorithm transfers the knowledge gathered by the agent regarding the risk to newly discovered states. We contribute with a new approach to consider risk based on similarity and with RMS, which is simple and generalizable as long as the premise similar states yield similar risk holds. RMS is not an RL algorithm, but a method to generate a risk-aware reward shaping signal that can be used with a RL algorithm to generate risk-aware policies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Safe reinforcement learning using risk mapping by similarity

Abstract

Talk to us

Similar Papers

More From: Adaptive Behavior

Lead the way for us

Journal: Adaptive Behavior	Publication Date: Jul 18, 2019
Citations: 2

Similar Papers

Evaluation of Safe Reinforcement Learning with CoMirror Algorithm in a Non-Markovian Reward Problem
Megumi Miyashita ... Shiro Yano
-
Megumi Miyashita, et. al.Megumi Miyashita ... Shiro Yano
01 Jan 2023
01 Jan 2023

Research and Application of Safe Reinforcement Learning in Power System
Jian Li ... Xinying Wang
-
Jian Li, et. al.Jian Li ... Xinying Wang
01 Apr 2023
01 Apr 2023

A comprehensive review on safe reinforcement learning for autonomous vehicle control in dynamic environments
Rohan Inamdar ... Nitish Katal
e-Prime - Advances in Electrical Engineering, Electronics and Energy | VOL. 10
Rohan Inamdar, et. al.Rohan Inamdar ... Nitish Katal
11 Oct 2024
e-Prime - Advances in Electrical Engineering, Electronics and Energy | VOL. 10

Self-Preserving Genetic Algorithms for Safe Learning in Discrete Action Spaces
Preston K Robinette ... Nathaniel P Hamilton
-
Preston K Robinette, et. al.Preston K Robinette ... Nathaniel P Hamilton
09 May 2023
09 May 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Safe reinforcement learning using risk mapping by similarity

Abstract

Talk to us

Similar Papers

More From: Adaptive Behavior