Deep imitation reinforcement learning with expert demonstration data

Menglong Yi,Xin Xu,Yujun Zeng,Seul Jung

doi:10.1049/joe.2018.8314

Abstract

In recent years, deep reinforcement learning (DRL) has made impressive achievements in many fields. However, existing DRL algorithms usually require a large amount of exploration to obtain a good action policy. In addition, in many complex situations, the reward function cannot be well designed to meet task requirements. These two problems will make it difficult for DRL to learn a good action policy within a relatively short period. The use of expert data can provide effective guidance and avoid unnecessary exploration. This study proposes a deep imitation reinforcement learning (DIRL) algorithm that uses a certain amount of expert demonstration data to speed up the training of DRL. In the proposed method, the learning agent imitates the expert's action policy by learning from demonstration data. After imitation learning, DRL is used to optimise the action policy in a self-learning way. By experimental comparison on a video game called the Mario racing game, it is shown that the proposed DIRL algorithm with expert demonstration data can obtain much better performance than previous DRL algorithms without expert guidance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of Engineering	Publication Date: Oct 31, 2018
Citations: 4	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Deep imitation reinforcement learning with expert demonstration data

Abstract

Talk to us

Similar Papers

More From: The Journal of Engineering

Lead the way for us

Similar Papers

Safe Deep Reinforcement Learning Based on Sample Value Evaluation
Rongjun Ye ... Hao Wang
-
Rongjun Ye, et. al.Rongjun Ye ... Hao Wang
02 Dec 2022
02 Dec 2022

Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey
Lanyu Xu ... Ning Wen
Physics in Medicine & Biology | VOL. 67
Lanyu Xu, et. al.Lanyu Xu ... Ning Wen
11 Nov 2022
Physics in Medicine & Biology | VOL. 67

Space Manipulator Assembly Operation Technique based on Deep Residual Reinforcement Learning
Kui Huang ... Binyan Liang
Journal of Physics: Conference Series | VOL. 2405
Kui Huang, et. al.Kui Huang ... Binyan Liang
01 Dec 2022
Journal of Physics: Conference Series | VOL. 2405

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep imitation reinforcement learning with expert demonstration data

Abstract

Talk to us

Similar Papers

More From: The Journal of Engineering