A target-driven visual navigation method based on intrinsic motivation exploration and space topological cognition

Xiaogang Ruan,Xiaoqing Zhu,Pengfei Liu,Peng Li

doi:10.1038/s41598-022-07264-7

Xiaogang Ruan, Xiaoqing Zhu + Show 2 more

Open Access

https://doi.org/10.1038/s41598-022-07264-7

Copy DOI

Abstract

Target-driven visual navigation is essential for many applications in robotics, and it has gained increasing interest in recent years. In this work, inspired by animal cognitive mechanisms, we propose a novel navigation architecture that simultaneously learns exploration policy and encodes environmental structure. First, to learn exploration policy directly from raw visual input, we use deep reinforcement learning as the basic framework and allow agents to create rewards for themselves as learning signals. In our approach, the reward for the current observation is driven by curiosity and calculated by a count-based approach and temporal distance. While agents learn exploration policy, we use temporal distance to find waypoints in observation sequences and incrementally describe the structure of the environment in a way that integrates episodic memory. Finally, space topological cognition is integrated into the model as a path planning module and combined with a locomotion network to obtain a more generalized approach to navigation. We test our approach in the DMlab, a visually rich 3D environment, and validate its exploration efficiency and navigation performance through extensive experiments. The experimental results show that our approach can explore and encode the environment more efficiently and has better capability in dealing with stochastic objects. In navigation tasks, agents can use space topological cognition to effectively reach the target and guide detour behaviour when a path is unavailable, exhibiting good environmental adaptability.

Highlights

To weigh the effects between them, we test the effects of different parameter sets that are set α + β ≡ 1 and sampled within the same interval (0.1) and show two main results: the episode reward (novelty rewards achieved by the agent within 1800 time steps) and the number of interactions required to encode the environment
The Deep Recurrent Q Network (DRQN) model is equipped with an long short-term memory (LSTM) and compensates for the memory deficit of the DQN, which remembers the target location and returns as many times as possible in an episode, but it requires a large number of time steps to find the target for the first time
We proposed a novel navigation architecture consisting of intrinsic motivation exploration and space topological cognition

Summary

Introduction

To weigh the effects between them, we test the effects of different parameter sets that are set α + β ≡ 1 and sampled within the same interval (0.1) and show two main results: the episode reward (novelty rewards achieved by the agent within 1800 time steps) and the number of interactions required to encode the environment.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Mar 2, 2022
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

A target-driven visual navigation method based on intrinsic motivation exploration and space topological cognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Deep imitation learning for 3D navigation tasks
Ahmed Hussein ... Mohamed Medhat Gaber
Neural Computing and Applications | VOL. 29
Ahmed Hussein, et. al.Ahmed Hussein ... Mohamed Medhat Gaber
04 Dec 2017
Neural Computing and Applications | VOL. 29

End-to-end Visual Navigation with Intrinsic Motivation in 3D Maze-like Environments
Peng Li ... Xiao-Qing Zhu
-
Peng Li, et. al.Peng Li ... Xiao-Qing Zhu
22 Oct 2021
22 Oct 2021

Speeding Up Affordance Learning for Tool Use, Using Proprioceptive and Kinesthetic Inputs
Khuong N Nguyen ... Jaewook Yoo
-
Khuong N Nguyen, et. al.Khuong N Nguyen ... Jaewook Yoo
01 Jul 2019
01 Jul 2019

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
Yan Zheng ... Zongzhang Zhang
-
Yan Zheng, et. al.Yan Zheng ... Zongzhang Zhang
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A target-driven visual navigation method based on intrinsic motivation exploration and space topological cognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports