Proximal evolutionary strategy: improving deep reinforcement learning through evolutionary policy optimization

Yiming Peng,Gang Chen,Mengjie Zhang,Bing Xue

doi:10.1007/s12293-024-00419-1

Abstract

Evolutionary Algorithms (EAs), including Evolutionary Strategies (ES) and Genetic Algorithms (GAs), have been widely accepted as competitive alternatives to Policy Gradient techniques for Deep Reinforcement Learning (DRL). However, they remain eclipsed by cutting-edge DRL algorithms in terms of time efficiency, sample complexity, and learning effectiveness. In this paper, aiming at advancing evolutionary DRL research, we develop an evolutionary policy optimization algorithm with three key technical improvements. First, we design an efficient layer-wise strategy for training DNNs through Covariance Matrix Adaptation Evolutionary Strategies (CMA-ES) in a highly scalable manner. Second, we establish a surrogate model based on proximal performance lower bound for fitness evaluations with low sample complexity. Third, we embed a gradient-based local search technique within the evolutionary policy optimization process to further improve the learning effectiveness. The three technical innovations jointly forge a new EA for DRL method named Proximal Evolutionary Strategies (PES). Our experiments on ten continuous control problems show that PES with layer-wise training can be more computationally efficient than CMA-ES; our surrogate model can remarkably reduce the sample complexity of PES in comparison to latest EAs for DRL including CMA-ES, OpenAI-ES, and Uber-GA; PES with gradient-based local search can significantly outperform several promising DRL algorithms including TRPO, AKCTR, PPO, OpenAI-ES, and Uber-GA.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Proximal evolutionary strategy: improving deep reinforcement learning through evolutionary policy optimization

Abstract

Published Version

Talk to us

Similar Papers

More From: Memetic Computing

Lead the way for us

Journal: Memetic Computing	Publication Date: Aug 17, 2024
License type: cc-by

Similar Papers

Harnessing deep reinforcement learning algorithms for image categorization: A multi algorithm approach
Dhanvanth Reddy Yerramreddy ... Don S
Engineering Applications of Artificial Intelligence | VOL. 136
Dhanvanth Reddy Yerramreddy, et. al.Dhanvanth Reddy Yerramreddy ... Don S
17 Jul 2024
Engineering Applications of Artificial Intelligence | VOL. 136

Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey
Lanyu Xu ... Ning Wen
Physics in Medicine & Biology | VOL. 67
Lanyu Xu, et. al.Lanyu Xu ... Ning Wen
11 Nov 2022
Physics in Medicine & Biology | VOL. 67

Space Manipulator Assembly Operation Technique based on Deep Residual Reinforcement Learning
Kui Huang ... Junyu Quan
Journal of Physics: Conference Series | VOL. 2405
Kui Huang, et. al.Kui Huang ... Junyu Quan
01 Dec 2022
Journal of Physics: Conference Series | VOL. 2405

Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Yong Ma ... Yuanzhou Zheng
Maritime Policy & Management | VOL. 47
Yong Ma, et. al.Yong Ma ... Yuanzhou Zheng
12 May 2020
Maritime Policy & Management | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Proximal evolutionary strategy: improving deep reinforcement learning through evolutionary policy optimization

Abstract

Published Version

Talk to us

Similar Papers

More From: Memetic Computing