Intelligent navigation method for multiple marine autonomous surface ships based on improved PPO algorithm

Zhewen Cui,Wei Guan,Wenzhe Luo,Xianku Zhang

doi:10.1016/j.oceaneng.2023.115783

Abstract

To achieve the autonomous navigation of multiple Marine Autonomous Surface Ship (MASS), an intelligent planning and decision-making method for MASS based on Rapidly-exploring Random Trees star (RRT-star) and improved Proximal Policy Optimization (PPO) algorithm is proposed. The novelty of the study is: (1) enhancing the traditional PPO algorithm by the Generalized Advantage Estimation (GAE) and the Long Short-Term Memory (LSTM) network, which facilitate accelerated convergence of average reward, predict the state space, and enhance accuracy in estimating the advantage function. (2) proposing a complete reward function, which can not only guide MASS to navigate towards the waypoint, but also ensure that MASS complies with the COLREGs during collision avoidance. It is worth highlighting that the trained network model can be generalized to different scenarios. We attached the trained neural network model to different MASS and simulated it in different open and narrow waters. Even in emergency situations, the method can still make it deviate from COLREGs and make flexible collision avoidance decisions. The results show that this method can handle the multi-MASS encounter situation well and make MASS reach the waypoint safely.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intelligent navigation method for multiple marine autonomous surface ships based on improved PPO algorithm

Abstract

Talk to us

Similar Papers

More From: Ocean Engineering

Lead the way for us

Journal: Ocean Engineering	Publication Date: Sep 12, 2023
Citations: 6

Similar Papers

Implementing action mask in proximal policy optimization (PPO) algorithm
Cheng-Yen Tang ... Chien-Hung Liu
ICT Express | VOL. 6
Cheng-Yen Tang, et. al.Cheng-Yen Tang ... Chien-Hung Liu
20 May 2020
ICT Express | VOL. 6

Multiple-UAV Reinforcement Learning Algorithm Based on Improved PPO in Ray Framework
Guang Zhan ... Xinmiao Zhang
Drones | VOL. 6
Guang Zhan, et. al.Guang Zhan ... Xinmiao Zhang
04 Jul 2022
Drones | VOL. 6

Research on Behavioral Decision at an Unsignalized Roundabout for Automatic Driving Based on Proximal Policy Optimization Algorithm
Jingpeng Gan ... Jiancheng Zhang
Applied Sciences | VOL. 14
Jingpeng Gan, et. al.Jingpeng Gan ... Jiancheng Zhang
29 Mar 2024
Applied Sciences | VOL. 14

Proximal policy optimization via enhanced exploration efficiency
Junwei Zhang ... Shuai Lü
Information Sciences | VOL. 609
Junwei Zhang, et. al.Junwei Zhang ... Shuai Lü
25 Jul 2022
Information Sciences | VOL. 609

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intelligent navigation method for multiple marine autonomous surface ships based on improved PPO algorithm

Abstract

Talk to us

Similar Papers

More From: Ocean Engineering