Evaluating Model-Free Reinforcement Learning toward Safety-Critical Tasks

Linrui Zhang,Bo Yuan,Dacheng Tao,Xueqian Wang,Li Shen,Qin Zhang

doi:10.1609/aaai.v37i12.26786

Abstract

Safety comes first in many real-world applications involving autonomous agents. Despite a large number of reinforcement learning (RL) methods focusing on safety-critical tasks, there is still a lack of high-quality evaluation of those algorithms that adheres to safety constraints at each decision step under complex and unknown dynamics. In this paper, we revisit prior work in this scope from the perspective of state-wise safe RL and categorize them as projection-based, recovery-based, and optimization-based approaches, respectively. Furthermore, we propose Unrolling Safety Layer (USL), a joint method that combines safety optimization and safety projection. This novel technique explicitly enforces hard constraints via the deep unrolling architecture and enjoys structural advantages in navigating the trade-off between reward improvement and constraint satisfaction. To facilitate further research in this area, we reproduce related algorithms in a unified pipeline and incorporate them into SafeRL-Kit, a toolkit that provides off-the-shelf interfaces and evaluation utilities for safety-critical tasks. We then perform a comparative study of the involved algorithms on six benchmarks ranging from robotic control to autonomous driving. The empirical results provide an insight into their applicability and robustness in learning zero-cost-return policies without task-dependent handcrafting. The project page is available at https://sites.google.com/view/saferlkit.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluating Model-Free Reinforcement Learning toward Safety-Critical Tasks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 6

Similar Papers

SRL-TR2: A Safe Reinforcement Learning Based TRajectory TRacker Framework
Chengyu Wang ... Xiangming Wen
IEEE Transactions on Intelligent Transportation Systems | VOL. 24
Chengyu Wang, et. al.Chengyu Wang ... Xiangming Wen
01 Jun 2023
IEEE Transactions on Intelligent Transportation Systems | VOL. 24

Accelerating Model-Free Reinforcement Learning With Imperfect Model Knowledge in Dynamic Spectrum Access
Lianjun Li ... Yang Yi
IEEE Internet of Things Journal | VOL. 7
Lianjun Li, et. al.Lianjun Li ... Yang Yi
01 Aug 2020
IEEE Internet of Things Journal | VOL. 7

Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
Cheng Gao ... Dan Wang
Journal of Building Engineering | VOL. 74
Cheng Gao, et. al.Cheng Gao ... Dan Wang
01 Sep 2023
Journal of Building Engineering | VOL. 74

Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on Videos.
Xingxing Wei ... Songping Wang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Xingxing Wei, et. al.Xingxing Wei ... Songping Wang
01 Sep 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating Model-Free Reinforcement Learning toward Safety-Critical Tasks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence