Reinforcement Learning-Based Autonomous Navigation and Obstacle Avoidance for USVs under Partially Observable Conditions

Nan Yan,Subin Huang,Chao Kong

doi:10.1155/2021/5519033

Nan Yan, Subin Huang + Show 1 more

Open Access

https://doi.org/10.1155/2021/5519033

Copy DOI

Abstract

Unmanned surface vehicles (USVs) have been widely used in research and exploration, patrol, and defense. Autonomous navigation and obstacle avoidance, as the essential technology of USVs, are the key conditions for successful mission execution. However, fine modeling of conventional algorithms cannot meet the real-time precise behavior control strategy of USVs in complex environments, which poses a great challenge to autonomous control policy. In this paper, a deep reinforcement learning-based UANOA (USVs autonomous navigation and obstacle avoidance) method is proposed. The UANOA achieves the autonomous navigation task of USVs by real-time sensing of partially complex ocean information around and real-time output of rudder angle control commands of USVs. In our work, we employ a double Q-network to achieve end-to-end control from raw sensor input to output of discrete rudder action, and design a set of reward functions that can be adapted to USV navigation and obstacle avoidance. To alleviate the decision bias caused by partial observable of USVs, we use the long short-term memory (LSTM) networks to enhance the ability to remember the ocean environment of USVs. Experiments demonstrate that UANOA ensures a USV arrives at the target points with optimal path planning in complex ocean environments without any collisions occurring, and UANOA outperforms deep Q-network (DQN) and random control policy in convergence speed, sailing distance, rudder angle steering consumption, and other performance measurements.

Highlights

Unmanned surface vehicles (USVs) are primarily used to perform tasks that are dangerous and unsuitable for manned vessels
We introduce Markov Decision Processes (MDPs) that are typically used to solve time-series complex decision tasks for modeling; secondly, we evaluate the advantages of deep reinforcement learning with double Q-learning, and we combine the observation space and control of the USV to derive the UANOA algorithm
We use MDP for modeling USV navigation and obstacle avoidance task. e proposed UANOA algorithm contains the MDP framework, and eventually, an optimal strategy π is learned by UANOA algorithm to achieve autonomous navigation of the USV

Summary

Introduction

USVs are primarily used to perform tasks that are dangerous and unsuitable for manned vessels. When the USV is equipped with a variety of customized sensors, communication devices, and other equipment, etc., it will have greater flexibility and intelligence to perform a variety of complex maritime tasks [1, 2]. Combining USVs with other unmanned systems, they can build rich clusters of unmanned systems in ocean, capable of handling more complex maritime missions [3, 4]. Erefore, the autonomous navigation and obstacle avoidance capabilities of USVs are highly required. At is to say, under certain constraints, the USV will depart from the initial location and adjust its navigation route in real time according to the changes in the external environment to reach the final destination USVs encounter different marine environments in different mission scenarios and often fail in their missions due to the harsh marine environment. erefore, the autonomous navigation and obstacle avoidance capabilities of USVs are highly required. at is to say, under certain constraints, the USV will depart from the initial location and adjust its navigation route in real time according to the changes in the external environment to reach the final destination

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Problems in Engineering	Publication Date: May 4, 2021
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Reinforcement Learning-Based Autonomous Navigation and Obstacle Avoidance for USVs under Partially Observable Conditions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

The autonomous navigation and obstacle avoidance for USVs with ANOA deep reinforcement learning method
Xing Wu ... Hamido Fujita
Knowledge-Based Systems | VOL. 196
Xing Wu, et. al.Xing Wu ... Hamido Fujita
21 Mar 2020
Knowledge-Based Systems | VOL. 196

PK-APF: Path-Keeping Algorithm for USVs Based on Artificial Potential Field
Yijie Chu ... Xiaohui Zhu
Applied Sciences | VOL. 12
Yijie Chu, et. al.Yijie Chu ... Xiaohui Zhu
17 Aug 2022
Applied Sciences | VOL. 12

Obstacle avoidance USV in multi-static obstacle environments based on a deep reinforcement learning approach
Dengyao Jiang ... Junfeng Xiong
Measurement and Control | VOL. 57
Dengyao Jiang, et. al.Dengyao Jiang ... Junfeng Xiong
12 Oct 2023
Measurement and Control | VOL. 57

A Virtual System and Method for Autonomous Navigation Performance Testing of Unmanned Surface Vehicles
Guoquan Xiao ... Xiaobin Hong
Journal of Marine Science and Engineering | VOL. 11
Guoquan Xiao, et. al.Guoquan Xiao ... Xiaobin Hong
28 Oct 2023
Journal of Marine Science and Engineering | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning-Based Autonomous Navigation and Obstacle Avoidance for USVs under Partially Observable Conditions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering