Accelerating Deep Reinforcement Learning Using Human Demonstration Data Based on Dual Replay Buffer Management and Online Frame Skipping

Sangho Yeo,Minsu Lee,Sangyoon Oh

doi:10.1109/bigcomp.2019.8679366

Abstract

Human demonstration data plays an important role in the early stage of deep reinforcement learning to accelerate the training process as well as guiding a reinforcement learning agent to learn complicated policy. However, most of current reinforcement learning approaches with human demonstration data and reward assumes that there is a sufficient amount of high-quality human demonstration data and that is not true for most real-world learning cases where enough amount of experts' demonstration data is always limited. To overcome this limitation, we propose a novel deep reinforcement learning approach with a dual replay buffer management and online frame skipping for human demonstration data sampling. The dual replay buffer consists of a human replay memory, an actor replay memory, and a replay manager. And it can manage two replay buffers with independent sampling policies. We also propose an online frame skipping to fully utilize available human data. During the training period, the frame skipping is performed dynamically to human replay buffer where the all of human data is stored. Two online frame-skipping, namely, FS-ER(Frame Skipping-Experience Replay) and DFS-ER(Dynamic Frame Skipping-Experience Replay) are used to sample data from human replay buffer. We conducted empirical experiments of four popular Atari games and the results show that our proposed two online frame skipping with dual replay memory outperforms existing baselines. Specifically, DFS-ER shows the fastest score increment during the reinforcement learning procedure in three out of four experiments. FS-ER shows the best performance in the other environment that is hard to train a model because of sparse reward.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accelerating Deep Reinforcement Learning Using Human Demonstration Data Based on Dual Replay Buffer Management and Online Frame Skipping

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Accelerated deep reinforcement learning with efficient demonstration utilization techniques
Sangho Yeo ... Sangyoon Oh
World Wide Web | VOL. 24
Sangho Yeo, et. al.Sangho Yeo ... Sangyoon Oh
11 Feb 2020
World Wide Web | VOL. 24

Model & Feature Agnostic Eye-in-Hand Visual Servoing using Deep Reinforcement Learning with Prioritized Experience Replay
Prerna Singh ... Virender Singh
-
Prerna Singh, et. al.Prerna Singh ... Virender Singh
01 Oct 2019
01 Oct 2019

Target‐driven visual navigation in indoor scenes using reinforcement learning and imitation learning
Qiang Fang ... Yujun Zeng
CAAI Transactions on Intelligence Technology | VOL. 7
Qiang Fang, et. al.Qiang Fang ... Yujun Zeng
21 Apr 2021
CAAI Transactions on Intelligence Technology | VOL. 7

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerating Deep Reinforcement Learning Using Human Demonstration Data Based on Dual Replay Buffer Management and Online Frame Skipping

Abstract

Talk to us

Similar Papers