Deep imitation learning for 3D navigation tasks

Ahmed Hussein,Mohamed Medhat Gaber,Chrisina Jayne,Eyad Elyan

doi:10.1007/s00521-017-3241-z

Ahmed Hussein, Mohamed Medhat Gaber + Show 2 more

Open Access

https://doi.org/10.1007/s00521-017-3241-z

Copy DOI

Abstract

Deep learning techniques have shown success in learning from raw high-dimensional data in various applications. While deep reinforcement learning is recently gaining popularity as a method to train intelligent agents, utilizing deep learning in imitation learning has been scarcely explored. Imitation learning can be an efficient method to teach intelligent agents by providing a set of demonstrations to learn from. However, generalizing to situations that are not represented in the demonstrations can be challenging, especially in 3D environments. In this paper, we propose a deep imitation learning method to learn navigation tasks from demonstrations in a 3D environment. The supervised policy is refined using active learning in order to generalize to unseen situations. This approach is compared to two popular deep reinforcement learning techniques: deep-Q-networks and Asynchronous actor-critic (A3C). The proposed method as well as the reinforcement learning methods employ deep convolutional neural networks and learn directly from raw visual input. Methods for combining learning from demonstrations and experience are also investigated. This combination aims to join the generalization ability of learning by experience with the efficiency of learning by imitation. The proposed methods are evaluated on 4 navigation tasks in a 3D simulated environment. Navigation tasks are a typical problem that is relevant to many real applications. They pose the challenge of requiring demonstrations of long trajectories to reach the target and only providing delayed rewards (usually terminal) to the agent. The experiments show that the proposed method can successfully learn navigation tasks from raw visual input while learning from experience methods fail to learn an effective policy. Moreover, it is shown that active learning can significantly improve the performance of the initially learned policy using a small number of active samples.

Highlights

Recent years have seen a rise in demand for intelligent agents capable of performing complex motor actions
The results on the Grid tasks show that considering static tasks, learning from demonstrations can be successful with far viewer training instances than learning from
We propose a framework for learning autonomous policies for navigation tasks from demonstrations

Summary

Introduction

Recent years have seen a rise in demand for intelligent agents capable of performing complex motor actions. It is difficult to breakdown and articulate how humans perform tasks in order to program intelligent agents to replicate this behavior. Finding a solution through trial and error may take too long, especially in problems that require performing long trajectories of actions with delayed rewards. The time to learn a policy to maximize the rewards exponentially increases Such challenges are present in many real-life applications and pose limitations to current methods. Another drawback is that learning through trial and error may result in a policy that solves the problem differently to how a human would. Performing a task in a manner that is intuitive to a human observer may be crucial in applications where humans and intelligent agents interact together in an environment

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural Computing and Applications	Publication Date: Dec 4, 2017
Citations: 39	License type: open-access

R Discovery Prime

R Discovery Prime

Deep imitation learning for 3D navigation tasks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Similar Papers

Deep Active Learning for Autonomous Navigation
Ahmed Hussein ... Eyad Elyan
-
Ahmed Hussein, et. al.Ahmed Hussein ... Eyad Elyan
01 Jan 2015
01 Jan 2015

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Deep learning based physical layer security for terrestrial communications in 5G and beyond networks: A survey
Himanshu Sharma ... Neeraj Kumar
Physical Communication | VOL. 57
Himanshu Sharma, et. al.Himanshu Sharma ... Neeraj Kumar
20 Jan 2023
Physical Communication | VOL. 57

Deep Reinforcement Learning Techniques in Diversified Domains: A Survey
Surbhi Gupta ... Gaurav Singal
Archives of Computational Methods in Engineering | VOL. 28
Surbhi Gupta, et. al.Surbhi Gupta ... Gaurav Singal
10 Feb 2021
Archives of Computational Methods in Engineering | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep imitation learning for 3D navigation tasks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural Computing and Applications