GoSafeOpt: Scalable safe exploration for global optimization of dynamical systems

Bhavya Sukhija,Matteo Turchetta,David Lindner,Andreas Krause,Sebastian Trimpe,Dominik Baumann

doi:10.1016/j.artint.2023.103922

GoSafeOpt: Scalable safe exploration for global optimization of dynamical systems

Bhavya Sukhija, Matteo Turchetta + Show 4 more

Open Access

https://doi.org/10.1016/j.artint.2023.103922

Copy DOI

Journal: Artificial Intelligence	Publication Date: Apr 20, 2023
Citations: 1	License type: cc-by

Affiliation: Uppsala University, Aalto University, ETH Zurich, RWTH Aachen University

#High-dimensional State Space #Model-free Methods + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Learning optimal control policies directly on physical systems is challenging. Even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. This work proposes GoSafeOpt as the first provably safe and optimal algorithm that can safely discover globally optimal policies for systems with high-dimensional state space. We demonstrate the superiority of GoSafeOpt over competing model-free safe learning methods in simulation and hardware experiments on a robot arm.

Full Text