Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

Chengbo Wang,Xinyu Zhang,Musa Bashir,Kwangil Lee,Zaili Yang

doi:10.3389/fmars.2022.1084763

Abstract

Reinforcement learning (RL) has shown superior performance in solving sequential decision problems. In recent years, RL is gradually being used to solve unmanned driving collision avoidance decision-making problems in complex scenarios. However, ships encounter many scenarios, and the differences in scenarios will seriously hinder the application of RL in collision avoidance at sea. Moreover, the iterative speed of trial-and-error learning for RL in multi-ship encounter scenarios is slow. To solve this problem, this study develops a novel intelligent collision avoidance algorithm based on approximate representation reinforcement learning (AR-RL) to realize the collision avoidance of maritime autonomous surface ships (MASS) in a continuous state space environment involving interactive learning capability like a crew in navigation situation. The new algorithm uses an approximate representation model to deal with the optimization of collision avoidance strategies in a dynamic target encounter situation. The model is combined with prior knowledge and International Regulations for Preventing Collisions at Sea (COLREGs) for optimal performance. This is followed by a design of an online solution to a value function approximation model based on gradient descent. This approach can solve the problem of large-scale collision avoidance policy learning in static-dynamic obstacles mixed environment. Finally, algorithm tests were constructed though two scenarios (i.e., the coastal static obstacle environment and the static-dynamic obstacles mixed environment) using Tianjin Port as an example and compared with multiple groups of algorithms. The results show that the algorithm can improve the large-scale learning efficiency of continuous state space of dynamic obstacle environment by approximate representation. At the same time, the MASS can efficiently and safely avoid obstacles enroute to reaching its target destination. It therefore makes significant contributions to ensuring safety at sea in a mixed traffic involving both manned and MASS in near future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Marine Science	Publication Date: Jan 19, 2023
Citations: 36	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

Abstract

Talk to us

Similar Papers

More From: Frontiers in Marine Science

Lead the way for us

Similar Papers

A ship collision avoidance system for human-machine cooperation during collision avoidance
Yamin Huang ... P.H.A.J.M Van Gelder
Ocean Engineering | VOL. 217
Yamin Huang, et. al.Yamin Huang ... P.H.A.J.M Van Gelder
18 Sep 2020
Ocean Engineering | VOL. 217

Navigation Situation Clustering Model of Human-Operated Ships for Maritime Autonomous Surface Ship Collision Avoidance Tests
Taewoong Hwang ... Ik-Hyun Youn
Journal of Marine Science and Engineering | VOL. 9
Taewoong Hwang, et. al.Taewoong Hwang ... Ik-Hyun Youn
20 Dec 2021
Journal of Marine Science and Engineering | VOL. 9

An Approach of Consensus-Based Double-Layer Blockchain System for Multi-Ship Collision Risk Mitigation Considering COLREGs
Yongjun Chen ... Yang Wang
Applied Sciences | VOL. 13
Yongjun Chen, et. al.Yongjun Chen ... Yang Wang
11 Oct 2023
Applied Sciences | VOL. 13

Deep reinforcement learning with dynamic window approach based collision avoidance path planning for maritime autonomous surface ships
Chuanbo Wu ... Weiqiang Liao
Ocean Engineering | VOL. 284
Chuanbo Wu, et. al.Chuanbo Wu ... Weiqiang Liao
01 Jul 2023
Ocean Engineering | VOL. 284

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

Abstract

Talk to us

Similar Papers

More From: Frontiers in Marine Science