Safe deployment of a reinforcement learning robot using self stabilization

Nanda Kishore Sreenivas,Shrisha Rao

doi:10.1016/j.iswa.2022.200105

Abstract

• In RL applications, there is no guarantee that a robot will operate within the same state space in which it was trained. • This paper focuses on ensuring the safe deployment of a robot. • This work defines a condition on the state and action spaces, that if satisfied, guarantees safe recovery. • We also propose a strategy and design that facilitate this recovery within a finite number of steps after perturbation. In toy environments like video games, a reinforcement learning agent is deployed and operates within the same state space in which it was trained. However, in robotics applications such as industrial systems or autonomous vehicles, this cannot be guaranteed. A robot can be pushed out of its training space by some unforeseen perturbation, which may cause it to go into an unknown state from which it has not been trained to move towards its goal. While most prior work in the area of RL safety focuses on ensuring safety in the training phase, this paper focuses on ensuring the safe deployment of a robot that has already been trained to operate within a safe space. This work defines a condition on the state and action spaces, that if satisfied, guarantees the robot’s recovery to safety independently. We also propose a strategy and design that facilitate this recovery within a finite number of steps after perturbation. This is implemented and tested against a standard RL model, and the results indicate a significant improvement in performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Safe deployment of a reinforcement learning robot using self stabilization

Abstract

Talk to us

Similar Papers

More From: Intelligent Systems with Applications

Lead the way for us

Journal: Intelligent Systems with Applications	Publication Date: Nov 1, 2022
License type: cc-by-nc-nd

Similar Papers

Adaptive co-construction of state and action spaces in reinforcement learning
Masato Nagayoshi ... Hajime Murao
Artificial Life and Robotics | VOL. 16
Masato Nagayoshi, et. al.Masato Nagayoshi ... Hajime Murao
01 Jun 2011
Artificial Life and Robotics | VOL. 16

Duality in Markov Decision Problems with Countable Action and State Spaces
John P Evans
Management Science | VOL. 15
John P EvansJohn P Evans
01 Jul 1969
Management Science | VOL. 15

Deep Reinforcement Learning with a Natural Language Action Space
Ji He ... Lihong Li
-
Ji He, et. al.Ji He ... Lihong Li
01 Jan 2015
01 Jan 2015

Learning to control a joint driven double inverted pendulum using nested actor/critic algorithm
N Kobori ... P Hartono
-
N Kobori, et. al.N Kobori ... P Hartono
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Safe deployment of a reinforcement learning robot using self stabilization

Abstract

Talk to us

Similar Papers

More From: Intelligent Systems with Applications