AUV 3D Path Planning Based on the Improved Hierarchical Deep Q Network

Yushan Sun,Guocheng Zhang,Xiangrui Ran,Hao Xu,Xiangbin Wang

doi:10.3390/jmse8020145

Yushan Sun, Guocheng Zhang + Show 3 more

Open Access

https://doi.org/10.3390/jmse8020145

Copy DOI

Abstract

This study proposed the 3D path planning of an autonomous underwater vehicle (AUV) by using the hierarchical deep Q network (HDQN) combined with the prioritized experience replay. The path planning task was divided into three layers, which realized the dimensionality reduction of state space and solved the problem of dimension disaster. An artificial potential field was used to design the positive rewards of the algorithm to shorten the training time. According to the different requirements of the task, this study modified the rewards in the training process to obtain different paths. The path planning simulation and field tests were carried out. The results of the tests corroborated that the training time of the proposed method was shorter than that of the traditional method. The path obtained by simulation training was proved to be safe and effective.

Highlights

As a key technology in the marine industry, autonomous underwater vehicles (AUVs) have been given considerable attention and application [1]
In view of the existing problems in the present studies, this study proposed an improved hierarchical deep Q network (HDQN) method with the prioritized experience replay to realize the three-dimensional path planning of AUV
After launching an AUV, the path nodes obtained by global path planning are transmitted to the lower computer by radio [16]

Summary

Introduction

As a key technology in the marine industry, autonomous underwater vehicles (AUVs) have been given considerable attention and application [1]. Petres [7] designed a continuous state, which used an anisotropic fast matching algorithm to complete the AUV path planning task This method only used linear evaluation function, which had certain limitations. Hiroshi et al [11] proposed a multi-layer training structure based on Q-Learning They carried out a planning simulation experiment on the R-ONE vehicle. Cheng et al [14] proposed a motion planning method based on DRL They used CNN to extract the characteristics of sensor information in order to make decisions on the motion. In view of the existing problems in the present studies, this study proposed an improved hierarchical deep Q network (HDQN) method with the prioritized experience replay to realize the three-dimensional path planning of AUV.

Path Planning Algorithm

HDQN and Prioritized Experience Replay

Prioritized Experience Replay

Set the Rewards and Actions

Field Experiment

Conclusion

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of marine science and engineering	Publication Date: Feb 24, 2020
Citations: 47	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

AUV 3D Path Planning Based on the Improved Hierarchical Deep Q Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of marine science and engineering

Lead the way for us

Similar Papers

A Multi-Task Algorithm for Autonomous Underwater Vehicles 3D path planning
Hao Hu ... Xingguang Peng
-
Hao Hu, et. al.Hao Hu ... Xingguang Peng
27 Nov 2020
27 Nov 2020

Path Planning for Autonomous Underwater Vehicle Based on Artificial Potential Field and Modified RRT
Jia Zhu ... Ran Zhao
-
Jia Zhu, et. al.Jia Zhu ... Ran Zhao
08 Jan 2021
08 Jan 2021

3D path planning, routing algorithms and routing protocols for unmanned air vehicles: a review
Samia Ben Amarat ... Peng Zong
Aircraft Engineering and Aerospace Technology | VOL. 91
Samia Ben Amarat, et. al.Samia Ben Amarat ... Peng Zong
13 Jun 2019
Aircraft Engineering and Aerospace Technology | VOL. 91

Simulation for Path Planning of SLOCUM Glider in Near-bottom Ocean Currents Using Heuristic Algorithms and Q-Learning
Utkarsh Gautam ... Malmathanraj Ramanathan
Defence science journal | VOL. 65
Utkarsh Gautam, et. al.Utkarsh Gautam ... Malmathanraj Ramanathan
29 May 2015
Defence science journal | VOL. 65

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AUV 3D Path Planning Based on the Improved Hierarchical Deep Q Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of marine science and engineering