Enhanced Reinforcement Learning with Targeted Dropout

Mark Jovic A Daday,Kristoffer Franz Mari R Millado

doi:10.1109/icd47981.2019.9105750

Abstract

In modern ages, the study on Reinforcement Learning (RL) has driven on Deep Q-Network (DQN) optimization learning prediction and control of Markov decision processes (MDPs). In this paper, the researcher used the Targeted Dropout strategy for RLs DQN that makes straight into learning and would be necessary to deal with MDPs with huge or continuous state and action spaces. Every weight/unit update, the targeted dropout selects a set of elements and to keep only the weights/units of maximum amount, and then apply dropout to the set. It has also a common pruning strategy so focus on fast approximations, such as removing weights with the smallest value or ranking the weights/units by the sensitivity of the network design and even rating by the sensitivity of the task execution with respect to the weights/units and removing the least-sensitive ones. The result shows that the proposed algorithm for enhancing the RL's DQN is more accurate in finding the best action to learn to achieve maximum reward. The simulation presents that in a minimal run of episodes it can achieve the maximum average reward, while without Targeted Dropout it takes more runs to achieve the average reward, and throughout the assessment of the algorithm, the suggested algorithm acquires more learning in finding the large reward value.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhanced Reinforcement Learning with Targeted Dropout

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Reinforcement learning algorithms with function approximation: Recent advances and applications
Xin Xu ... Zhenhua Huang
Information Sciences | VOL. 261
Xin Xu, et. al.Xin Xu ... Zhenhua Huang
05 Sep 2013
Information Sciences | VOL. 261

Continuous-action reinforcement learning with fast policy search and adaptive basis function selection
Xin Xu ... Dewen Hu
Soft Computing | VOL. 15
Xin Xu, et. al.Xin Xu ... Dewen Hu
28 Mar 2010
Soft Computing | VOL. 15

Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning
Enrico Anderlini ... Gordon G Parker
Applied Sciences | VOL. 9
Enrico Anderlini, et. al.Enrico Anderlini ... Gordon G Parker
21 Aug 2019
Applied Sciences | VOL. 9

Learning Agents with Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar ... Gopalakrishnan Srinivasaraghavan
-
Rajesh Mangannavar, et. al.Rajesh Mangannavar ... Gopalakrishnan Srinivasaraghavan
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced Reinforcement Learning with Targeted Dropout

Abstract

Talk to us

Similar Papers