Trust Region Evolution Strategies

Guoqing Liu,Tao Qin,Tie-Yan Liu,Nenghai Yu,Li Zhao,Feidiao Yang,Jiang Bian

doi:10.1609/aaai.v33i01.33014352

Abstract

Evolution Strategies (ES), a class of black-box optimization algorithms, has recently been demonstrated to be a viable alternative to popular MDP-based RL techniques such as Qlearning and Policy Gradients. ES achieves fairly good performance on challenging reinforcement learning problems and is easier to scale in a distributed setting. However, standard ES algorithms perform one gradient update per data sample, which is not very efficient. In this paper, with the purpose of more efficient using of sampled data, we propose a novel iterative procedure that optimizes a surrogate objective function, enabling to reuse data sample for multiple epochs of updates. We prove monotonic improvement guarantee for such procedure. By making several approximations to the theoretically-justified procedure, we further develop a practical algorithm called Trust Region Evolution Strategies (TRES). Our experiments demonstrate the effectiveness of TRES on a range of popular MuJoCo locomotion tasks in the OpenAI Gym, achieving better performance than ES algorithm.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Trust Region Evolution Strategies

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 24

Similar Papers

Comparison of efficiency between differential evolution and evolution strategy: application of the LST model to the Be River catchment in Vietnam
Nguyen Thi Thuy Hang ... Hidetaka Chikamori
Paddy and Water Environment | VOL. 15
Nguyen Thi Thuy Hang, et. al.Nguyen Thi Thuy Hang ... Hidetaka Chikamori
10 Apr 2017
Paddy and Water Environment | VOL. 15

Application of evolutionary algorithms to optimise one- and two-dimensional gradient chromatographic separations.
Bram Huygens ... Ann Nowé
Journal of chromatography. A | VOL. 1628
Bram Huygens, et. al.Bram Huygens ... Ann Nowé
28 Jul 2020
Journal of chromatography. A | VOL. 1628

A Study on Self-adaptation in the Evolutionary Strategy Algorithm
Noureddine Boukhari ... Nicolas Monmarché
-
Noureddine Boukhari, et. al.Noureddine Boukhari ... Nicolas Monmarché
01 Jan 2018
01 Jan 2018

Self-Guided Evolution Strategies with Historical Estimated Gradients
Fei-Yu Liu ... Chao Qian
-
Fei-Yu Liu, et. al.Fei-Yu Liu ... Chao Qian
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Trust Region Evolution Strategies

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence