Model-Free Safe Reinforcement Learning Through Neural Barrier Certificate

Yujie Yang,Yuxuan Jiang,Jianyu Chen,Shengbo Eben Li,Yichen Liu

doi:10.1109/lra.2023.3238656

Abstract

Safety is a critical concern when applying reinforcement learning (RL) to real-world control tasks. However, existing safe RL works either only consider expected safety constraint violations and fail to maintain safety guarantees, or use overly conservative safety certificate tools borrowed from safe control theory, which sacrifices reward optimization and relies on analytic system models. This letter proposes a model-free safe RL algorithm that achieves near-zero constraint violations with high rewards. Our key idea is to jointly learn a policy and a neural barrier certificate under stepwise state constraint setting. The barrier certificate is learned in a model-free manner by minimizing the violations of appropriate barrier properties on transition data collected by the policy. We extend the single-step invariant property of the barrier certificate to a multi-step version and construct the corresponding multi-step invariant loss. This loss balances the bias and variance of the barrier certificate and enhances both the safety and performance of the policy. The policy is optimized under the constraint of the multi-step invariant property using the Lagrangian method. We optimize the policy in a model-free manner by introducing an importance sampling weight in the constraint. We test our algorithm on multiple problems, including classic control tasks, robot collision avoidance, and autonomous driving. Results show that our algorithm achieves near-zero constraint violations and high performance compared to the baselines. Moreover, the learned barrier certificates successfully identify the feasible regions on multiple tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Model-Free Safe Reinforcement Learning Through Neural Barrier Certificate

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Journal: IEEE Robotics and Automation Letters	Publication Date: Mar 1, 2023
Citations: 13

Similar Papers

Safe reinforcement learning for dynamical systems using barrier certificates
Qingye Zhao ... Xuandong Li
Connection Science | VOL. 34
Qingye Zhao, et. al.Qingye Zhao ... Xuandong Li
12 Dec 2022
Connection Science | VOL. 34

An Iterative Scheme of Safe Reinforcement Learning for Nonlinear Systems via Barrier Certificate Generation
Zhengfeng Yang ... Xia Zeng
-
Zhengfeng Yang, et. al.Zhengfeng Yang ... Xia Zeng
01 Jan 2020
01 Jan 2020

Accelerating Model-Free Reinforcement Learning With Imperfect Model Knowledge in Dynamic Spectrum Access
Lianjun Li ... Hao-Hsuan Chang
IEEE Internet of Things Journal | VOL. 7
Lianjun Li, et. al.Lianjun Li ... Hao-Hsuan Chang
01 Aug 2020
IEEE Internet of Things Journal | VOL. 7

Research and Application of Safe Reinforcement Learning in Power System
Jian Li ... Xinying Wang
-
Jian Li, et. al.Jian Li ... Xinying Wang
01 Apr 2023
01 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model-Free Safe Reinforcement Learning Through Neural Barrier Certificate

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters