PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning

Dor Livne,Kobi Cohen

doi:10.1109/jstsp.2020.2967566

Abstract

The recent success of deep neural networks (DNNs) for function approximation in reinforcement learning has triggered the development of Deep Reinforcement Learning (DRL) algorithms in various fields, such as robotics, computer games, natural language processing, computer vision, sensing systems, and wireless networking. Unfortunately, DNNs suffer from high computational cost and memory consumption, which limits the use of DRL algorithms in systems with limited hardware resources. In recent years, pruning algorithms have demonstrated considerable success in reducing the redundancy of DNNs in classification tasks. However, existing algorithms suffer from a significant performance reduction in the DRL domain. In this article, we develop the first effective solution to the performance reduction problem of pruning in the DRL domain, and establish a working algorithm, named Policy Pruning and Shrinking (PoPS), to train DRL models with strong performance while achieving a compact representation of the DNN. The framework is based on a novel iterative policy pruning and shrinking method that leverages the power of transfer learning when training the DRL model. We present an extensive experimental study that demonstrates the strong performance of PoPS using the popular Cartpole, Lunar Lander, Pong, and Pacman environments. Finally, we develop an open source software for the benefit of researchers and developers in related fields.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Signal Processing

Lead the way for us

Journal: IEEE Journal of Selected Topics in Signal Processing	Publication Date: May 1, 2020
Citations: 42

Similar Papers

Space Manipulator Assembly Operation Technique based on Deep Residual Reinforcement Learning
Kui Huang ... Junyu Quan
Journal of Physics: Conference Series | VOL. 2405
Kui Huang, et. al.Kui Huang ... Junyu Quan
01 Dec 2022
Journal of Physics: Conference Series | VOL. 2405

What can classic Atari video games tell us about the human brain?
Raphael Köster ... Martin J Chadwick
Neuron | VOL. 109
Raphael Köster, et. al.Raphael Köster ... Martin J Chadwick
01 Feb 2021
Neuron | VOL. 109

DDPG Agent to Swing Up and Balance Cart- Pole System
Buvanesh Pandian V
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Buvanesh Pandian VBuvanesh Pandian V
09 Apr 2021
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Yong Ma ... Yuanzhou Zheng
Maritime Policy & Management | VOL. 47
Yong Ma, et. al.Yong Ma ... Yuanzhou Zheng
12 May 2020
Maritime Policy & Management | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Signal Processing