Safe reinforcement learning for affine nonlinear systems with state constraints and input saturation using control barrier functions

Shihan Liu,Lijun Liu,Zhen Yu

doi:10.1016/j.neucom.2022.11.006

Shihan Liu, Lijun Liu + Show 1 more

https://doi.org/10.1016/j.neucom.2022.11.006

Copy DOI

Export

Save

Cite

Journal: Neurocomputing	Publication Date: Nov 7, 2022
Citations: 5

Affiliation: Xiamen University

Abstract
Full-Text
Similar Papers

Abstract

Listen

This paper provides a novel safe reinforcement learning (RL) control algorithm to solve safe optimal problems for discrete-time affine nonlinear systems, while the safety and convergence of the control algorithm are proven. The algorithm is proposed based on an adjusted policy iteration (PI) framework using only the measured data along the system trajectories in the environment. The adjusted PI algorithm combines with the system predictive information. Unlike most PI algorithms, an effective method of obtaining an initial safe and stable control policy is given here. In addition, control barrier functions (CBFs) and an input constraint function are introduced to augment reward functions. And the monotonically nonincreasing property of the iterative value function maintains the safe set forward invariant in the PI framework. Moreover, the safety and convergence of the proposed algorithm are proven in theory. Then, the design and implementation of the proposed algorithm are presented based on the identifier-actor-critic structure, where neural networks are employed to approximate the system dynamics, the iterative control policy, and the iterative value function, respectively. Finally, the simulation results illustrate the effectiveness and safety of the proposed algorithm.

Full Text