Imitation learning of stabilizing policies for nonlinear systems

Sebastian East

doi:10.1016/j.ejcon.2022.100678

Imitation learning of stabilizing policies for nonlinear systems

Sebastian East

Open Access

https://doi.org/10.1016/j.ejcon.2022.100678

Copy DOI

Journal: European Journal of Control	Publication Date: Jun 15, 2022
Citations: 1	License type: cc-by

Affiliation: University of Bristol

#Sum Of Squares Techniques #Imitation Learning + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

There has been a recent interest in imitation learning methods that are guaranteed to produce a stabilizing control law with respect to a known system. Work in this area has generally considered linear systems and controllers, for which stabilizing imitation learning takes the form of a biconvex optimization problem. In this paper it is demonstrated that the same methods developed for linear systems and controllers can be readily extended to polynomial systems and controllers using sum of squares techniques. A projected gradient descent algorithm and an alternating direction method of multipliers algorithm are proposed as heuristics for solving the stabilizing imitation learning problem, and their performance is illustrated through numerical experiments.

Full Text