Abstract

Learning from demonstration, as an important component of imitation learning, is a paradigm for robot to learn new tasks. Considering the application of learning from demonstration in the navigation issue, the robot can also acquire the navigation task via the human teacher’s demonstration. Based on research of the human brain neocortex, in this article, we present a learning from demonstration navigation paradigm from the perspective of hierarchical temporal memory theory. As a type of end-to-end learning form, the demonstrated relationship between perception data and motion commands will be learned and predicted by using hierarchical temporal memory. This framework first perceives images to obtain the corresponding categories information; then the categories incorporated with depth and motion command data are encoded as a sequence of sparse distributed representation vectors. The sequential vectors are treated as the inputs to train the navigation hierarchical temporal memory. After the training, the navigation hierarchical temporal memory stores the transitions of the perceived images, depth, and motion data so that future motion commands can be predicted. The performance of the proposed navigation strategy is evaluated via the real experiments and the public data sets.

Highlights

  • Learning from demonstration (LfD), as an important issue in imitation learning,[1] is a paradigm for robot to learn new tasks

  • As the human being learns the navigation behavior above, the robot can acquire the navigation task via the human teacher’s demonstration. This natural communication way between the human teacher and robot learner releases the complex couple of perception and planning in the navigation process, and LfD for autonomous navigation has become an attractive topic in robotics area

  • If the anomaly likelihood of any action is above a predefined probability threshold PTh_ano (0.90 in our experiment, i.e. the probability or accuracy of the green section is 90%, which is equivalent to a 1.65s tolerance interval for a normal distribution), we designed a simple action retrieval strategy, that is, recalling the remembered action sequence stored in Nav-Hierarchical temporal memory (HTM) to replace that which has a higher anomaly likelihood

Read more

Summary

Introduction

Learning from demonstration (LfD), as an important issue in imitation learning,[1] is a paradigm for robot to learn new tasks. Considering the application of LfD in the navigation issue, a person follows the tour guide to move from any position to the destination when he first visits an unknown place. After the person remembers the path which the guide showed, he learned the navigation skill on how to go to the destination in that place. As the human being learns the navigation behavior above, the robot can acquire the navigation task via the human teacher’s demonstration. This natural communication way between the human teacher and robot learner releases the complex couple of perception and planning in the navigation process, and LfD for autonomous navigation has become an attractive topic in robotics area

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call