Automatic ship collision avoidance using deep reinforcement learning with LSTM in continuous action spaces

Ryohei Sawada,Takahiro Majima,Keiji Sato

doi:10.1007/s00773-020-00755-0

Ryohei Sawada, Takahiro Majima + Show 1 more

Open Access

https://doi.org/10.1007/s00773-020-00755-0

Copy DOI

Abstract

This paper presents an automatic collision avoidance algorithm for ships using a deep reinforcement learning (DRL) in continuous action spaces. Obstacle zone by target (OZT) is used to compute an area where a collision will happen in the future based on dynamic information of ships. Agents of DRL detects the approach of multiple ships using a virtual sensor called the grid sensor. Agents learned collision avoidance maneuvering through Imazu problem, which is a scenario set of ship encounter situations. In this study, we propose a new approach for collision avoidance with a longer safe passing distance using DRL. We develop a novel method named inside OZT that expands OZT to improve the consistency of learning. We redesign the network using the long short-term memory (LSTM) cell and carried out training in continuous action spaces to train a model with longer safe distance than the previous study. The bow cross range in collision detection proposed in this paper is effective to COLREGs-compliant collision avoidance. The trained model has passed all scenarios of Imazu problem. The model is also validated by a test scenario which includes more ships than each scenario of Imazu problem.

Highlights

IntroductionThere has been a lot of research and development on automated ships
In recent years, there has been a lot of research and development on automated ships
We show the results for all scenarios of Imazu problem using the two trained models of continuous action spaces and the previous trained models used in the previous study [17]

Summary

Introduction

There has been a lot of research and development on automated ships. It is reported that collision accidents of ships were mainly caused by human errors such as[2]. By supporting human or automating operations, the number of collision accidents can be decreased. Automatic collision avoidance has been studied for a long time, and a number of algorithms have been proposed [3]. In 1980s, Imazu and Koyama utilized a dynamic programming [4,5,6]. In this method, the ship’s speed and heading angle are defined in a discrete action space, and collision avoidance is performed by selecting the optimal action with an evaluation function based on the International Regulations for Preventing Collisions at Sea (COLREGs) and rules of

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Marine Science and Technology	Publication Date: Aug 3, 2020
Citations: 84	License type: open-access

R Discovery Prime

R Discovery Prime

Automatic ship collision avoidance using deep reinforcement learning with LSTM in continuous action spaces

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Marine Science and Technology

Lead the way for us

Similar Papers

Energy management of hybrid electric bus based on deep reinforcement learning in continuous state and action space
Huachun Tan ... Yuankai Wu
Energy Conversion and Management | VOL. 195
Huachun Tan, et. al.Huachun Tan ... Yuankai Wu
18 May 2019
Energy Conversion and Management | VOL. 195

Deep reinforcement learning in continuous action space for autonomous robotic surgery.
Amin Abbasi Shahkoo ... Ahmad Ali Abin
International journal of computer assisted radiology and surgery | VOL. 18
Amin Abbasi Shahkoo, et. al.Amin Abbasi Shahkoo ... Ahmad Ali Abin
16 Nov 2022
International journal of computer assisted radiology and surgery | VOL. 18

MASAC-based confrontation game method of UAV clusters
尔申王 ... 靖郭
SCIENTIA SINICA Informationis | VOL. 52
尔申王, et. al.尔申王 ... 靖郭
01 Dec 2022
SCIENTIA SINICA Informationis | VOL. 52

Goal-Oriented Obstacle Avoidance with Deep Reinforcement Learning in Continuous Action Space
Reinis Cimurs ... Il Hong Suh
Electronics | VOL. 9
Reinis Cimurs, et. al.Reinis Cimurs ... Il Hong Suh
28 Feb 2020
Electronics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic ship collision avoidance using deep reinforcement learning with LSTM in continuous action spaces

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Marine Science and Technology