An Iterative Scheme of Safe Reinforcement Learning for Nonlinear Systems via Barrier Certificate Generation

Zhengfeng Yang,Xiaochao Tang,Zhiming Liu,Wang Lin,Zhenbing Zeng,Yidan Zhang,Xia Zeng

doi:10.1007/978-3-030-81685-8_22

Abstract

AbstractIn this paper, we propose a safe reinforcement learning approach to synthesize deep neural network (DNN) controllers for nonlinear systems subject to safety constraints. The proposed approach employs an iterative scheme where alearnerand averifierinteract to synthesize safe DNN controllers. Thelearnertrains a DNN controller via deep reinforcement learning, and theverifiercertifies the learned controller through computing a maximal safe initial region and its corresponding barrier certificate, based on polynomial abstraction and bilinear matrix inequalities solving. Compared with the existing verification-in-the-loop synthesis methods, our iterative framework is a sequential synthesis scheme of controllers and barrier certificates, which can learn safe controllers with adaptive barrier certificates rather than user-defined ones. We implement the tool SRLBC and evaluate its performance over a set of benchmark examples. The experimental results demonstrate that our approach efficiently synthesizes safe DNN controllers even for a nonlinear system with dimension up to 12.

Highlights

The design and synthesis of controllers for dynamical systems is a fundamental problem in the field of control
– We propose a safe reinforcement learning via barrier certificate generation to synthesize deep neural network (DNN) controller, which can guarantee the unbounded-time safety of the closed-loop systems
We have developed a novel scheme for synthesizing safe controllers of nonlinear systems with control against safety constraints

Summary

Introduction

The design and synthesis of controllers for dynamical systems is a fundamental problem in the field of control. A majority of these works lack formal reasoning about the safety of such DNN-controlled dynamical systems from such learning process. To guarantee the safety property of synthesized DNN controllers, considerable works focus on the safety verification of DNN-controlled closed-loop systems, which is a really hard problem because it is tangled with highly nonlinear DNN expressions. Other than formally verifying synthesized DNN controllers, more recent works have been proposed to learn DNN controllers for dynamical systems with safety guarantees [8,39,40]. A verification-in-the-loop DNN controller training algorithm is presented in [8], which integrates RL framework with user-provided control barrier functions (CBFs) for reward function encoding, combined with SMT based formal CBF checking; a correctness-by-design method is proposed in [39] that first learns DNN controllers and barrier certificates simultaneously using supervised learning, and performs posterior formal verification of barrier certificates via SMT solvers

Objectives

Methods

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Iterative Scheme of Safe Reinforcement Learning for Nonlinear Systems via Barrier Certificate Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 6	License type: CC BY 4.0

Similar Papers

Safe Reinforcement Learning Algorithm and Its Application in Intelligent Control for CPS
Hengjun Zhao ... Zhiming Liu
International Journal of Software and Informatics | VOL. 12
Hengjun Zhao, et. al.Hengjun Zhao ... Zhiming Liu
01 Jan 2021
International Journal of Software and Informatics | VOL. 12

Safe reinforcement learning for dynamical systems using barrier certificates
Qingye Zhao ... Xuandong Li
Connection Science | VOL. 34
Qingye Zhao, et. al.Qingye Zhao ... Xuandong Li
12 Dec 2022
Connection Science | VOL. 34

Model-Free Safe Reinforcement Learning Through Neural Barrier Certificate
Yujie Yang ... Yichen Liu
IEEE Robotics and Automation Letters | VOL. 8
Yujie Yang, et. al.Yujie Yang ... Yichen Liu
01 Mar 2023
IEEE Robotics and Automation Letters | VOL. 8

Safe deep reinforcement learning in diesel engine emission control.
Armin Norouzi ... Charles Robert Koch
Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering | VOL. 237
Armin Norouzi, et. al.Armin Norouzi ... Charles Robert Koch
17 Feb 2023
Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering | VOL. 237

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Iterative Scheme of Safe Reinforcement Learning for Nonlinear Systems via Barrier Certificate Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers