Neuroevolution strategies for episodic reinforcement learning

Verena Heidrich-Meisner,Christian Igel

doi:10.1016/j.jalgor.2009.04.002

Abstract

Because of their convincing performance, there is a growing interest in using evolutionary algorithms for reinforcement learning. We propose learning of neural network policies by the covariance matrix adaptation evolution strategy (CMA-ES), a randomized variable-metric search algorithm for continuous optimization. We argue that this approach, which we refer to as CMA Neuroevolution Strategy (CMA-NeuroES), is ideally suited for reinforcement learning, in particular because it is based on ranking policies (and therefore robust against noise), efficiently detects correlations between parameters, and infers a search direction from scalar reinforcement signals. We evaluate the CMA-NeuroES on five different (Markovian and non-Markovian) variants of the common pole balancing problem. The results are compared to those described in a recent study covering several RL algorithms, and the CMA-NeuroES shows the overall best performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neuroevolution strategies for episodic reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Journal of Algorithms

Lead the way for us

Journal: Journal of Algorithms	Publication Date: May 8, 2009
Citations: 79

Similar Papers

Design of acoustic metamaterials using the covariance matrix adaptation evolutionary strategy
Bei Huang ... Qiang Cheng
Applied Physics Express | VOL. 10
Bei Huang, et. al.Bei Huang ... Qiang Cheng
31 Jan 2017
Applied Physics Express | VOL. 10

CMA-ES with coordinate selection for high-dimensional and ill-conditioned functions
Hiroki Shimizu ... Masashi Toyoda
-
Hiroki Shimizu, et. al.Hiroki Shimizu ... Masashi Toyoda
07 Jul 2021
07 Jul 2021

Stepping ahead Firefly Algorithm and hybridization with evolution strategy for global optimization problems
Ravneil Nand ... Kaylash Chaudhary
Applied Soft Computing | VOL. 109
Ravneil Nand, et. al.Ravneil Nand ... Kaylash Chaudhary
24 May 2021
Applied Soft Computing | VOL. 109

Covariance Matrix Adaptation for Multi-objective Optimization
Christian Igel ... Nikolaus Hansen
Evolutionary Computation | VOL. 15
Christian Igel, et. al.Christian Igel ... Nikolaus Hansen
01 Mar 2007
Evolutionary Computation | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neuroevolution strategies for episodic reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Journal of Algorithms