Empirical evaluation of contextual policy search with a comparison-based surrogate model and active covariance matrix adaptation

Alexander Fabisch

doi:10.1145/3319619.3321935

Empirical evaluation of contextual policy search with a comparison-based surrogate model and active covariance matrix adaptation

Alexander Fabisch

Open Access

https://doi.org/10.1145/3319619.3321935

Copy DOI

Publication Date: Jul 13, 2019

Affiliation: German Research Centre for Artificial Intelligence

#Covariance Matrix Adaptation #Active Covariance Matrix Adaptation + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Contextual policy search (CPS) is a class of multi-task reinforcement learning algorithms that is particularly useful for robotic applications. A recent state-of-the-art method is Contextual Covariance Matrix Adaptation Evolution Strategies (C-CMA-ES). It is based on the standard black-box optimization algorithm CMA-ES. There are two useful extensions of CMA-ES that we will transfer to C-CMA-ES and evaluate empirically: ACM-ES, which uses a comparison-based surrogate model, and aCMA-ES, which uses an active update of the covariance matrix. We will show that improvements with these methods can be impressive in terms of sample-efficiency, although this is not relevant any more for the robotic domain.

Full Text