A composite learning method for multi-ship collision avoidance based on reinforcement learning and inverse control

Shuo Xie,Xiumin Chu,Mao Zheng,Chenguang Liu

doi:10.1016/j.neucom.2020.05.089

Abstract

Model-free reinforcement learning methods have potentials in ship collision avoidance under unknown environments. To defect the low efficiency problem of the model-free reinforcement learning, a composite learning method is proposed based on an asynchronous advantage actor-critic (A3C) algorithm, a long short-term memory neural network (LSTM) and Q-learning. The proposed method uses Q-learning for adaptive decisions between a LSTM inverse model-based controller and the model-free A3C policy. Multi-ship collision avoidance simulations are conducted to verify the effectiveness of the model-free A3C method, the proposed inverse model-based method and the composite learning method. The simulation results indicate that the proposed composite learning based ship collision avoidance method outperforms the A3C learning method and a traditional optimization-based method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A composite learning method for multi-ship collision avoidance based on reinforcement learning and inverse control

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jun 4, 2020
Citations: 42

Similar Papers

Decision-making method for multi-ship collision avoidance based on improved extensive game model
Yijing Tu ... Yong Xiong
Maritime Technology and Research | VOL. 4
Yijing Tu, et. al.Yijing Tu ... Yong Xiong
07 May 2022
Maritime Technology and Research | VOL. 4

Automatic collision avoidance of multiple ships based on deep Q-learning
Haiqing Shen ... Chen Guo
Applied Ocean Research | VOL. 86
Haiqing Shen, et. al.Haiqing Shen ... Chen Guo
14 Mar 2019
Applied Ocean Research | VOL. 86

Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
Cheng Gao ... Dan Wang
Journal of Building Engineering | VOL. 74
Cheng Gao, et. al.Cheng Gao ... Dan Wang
01 Sep 2023
Journal of Building Engineering | VOL. 74

Collision Avoidance Decision Method for Unmanned Surface Vehicle Based on an Improved Velocity Obstacle Algorithm
Yun Li ... Haiyu Zhang
Journal of Marine Science and Engineering | VOL. 10
Yun Li, et. al.Yun Li ... Haiyu Zhang
29 Jul 2022
Journal of Marine Science and Engineering | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A composite learning method for multi-ship collision avoidance based on reinforcement learning and inverse control

Abstract

Talk to us

Similar Papers

More From: Neurocomputing