Combining Model-Based Design and Model-Free Policy Optimization to Learn Safe, Stabilizing Controllers

Tyler Westenbroek,Koushil Sreenath,Fernando Castañeda,S Shankar Sastry,Ayush Agrawal

doi:10.1016/j.ifacol.2021.08.468

Abstract

Abstract This paper introduces a framework for learning a safe, stabilizing controller for a system with unknown dynamics using model-free policy optimization algorithms. Using a nominal dynamics model, the user specifies a candidate Control Lyapunov Function (CLF) around the desired operating point, and specifies the desired safe-set using a Control Barrier Function (CBF). Using penalty methods from the optimization literature, we then develop a family of policy optimization problems which attempt to minimize control effort while satisfying the pointwise constraints used to specify the CLF and CBF. We demonstrate that when the penalty terms are scaled correctly, the optimization prioritizes the maintenance of safety over stability, and stability over optimality. We discuss how standard reinforcement learning algorithms can be applied to the problem, and validate the approach through simulation. We then illustrate how the approach can be applied to a class of hybrid models commonly used in the dynamic walking literature, and use it to learn safe, stable walking behavior over a randomly spaced sequence of stepping stones.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combining Model-Based Design and Model-Free Policy Optimization to Learn Safe, Stabilizing Controllers

Abstract

Talk to us

Similar Papers

More From: IFAC-PapersOnLine

Lead the way for us

Journal: IFAC-PapersOnLine	Publication Date: Jan 1, 2021
Citations: 9

Similar Papers

Comparative Analysis of Control Barrier Functions and Artificial Potential Fields for Obstacle Avoidance
Andrew Singletary ... Andrew Browning
-
Andrew Singletary, et. al.Andrew Singletary ... Andrew Browning
27 Sep 2021
27 Sep 2021

Event-Triggered Control for Safety-Critical Systems With Unknown Dynamics
Wei Xiao ... Calin Belta
IEEE Transactions on Automatic Control | VOL. -
Wei Xiao, et. al.Wei Xiao ... Calin Belta
01 Jan 2021
IEEE Transactions on Automatic Control | VOL. -

A Survey on the Control Lyapunov Function and Control Barrier Function for Nonlinear-Affine Control Systems
Boqian Li ... Zheng Yan
IEEE/CAA Journal of Automatica Sinica | VOL. 10
Boqian Li, et. al.Boqian Li ... Zheng Yan
01 Mar 2023
IEEE/CAA Journal of Automatica Sinica | VOL. 10

Heterogeneous optimal formation control of nonlinear multi-agent systems with unknown dynamics by safe reinforcement learning
Fatemeh Mahdavi Golmisheh ... Saeed Shamaghdari
Applied Mathematics and Computation | VOL. 460
Fatemeh Mahdavi Golmisheh, et. al.Fatemeh Mahdavi Golmisheh ... Saeed Shamaghdari
30 Aug 2023
Applied Mathematics and Computation | VOL. 460

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining Model-Based Design and Model-Free Policy Optimization to Learn Safe, Stabilizing Controllers

Abstract

Talk to us

Similar Papers

More From: IFAC-PapersOnLine