DWAS-RL: A safety-efficiency balanced reinforcement learning approach for path planning of Unmanned Surface Vehicles in complex marine environments

Tianci Qu,Gang Xiong,Hub Ali,Xisong Dong,Yunjun Han,Zhen Shen,Fei-Yue Wang

doi:10.1016/j.oceaneng.2024.119641

Tianci Qu, Gang Xiong + Show 5 more

https://doi.org/10.1016/j.oceaneng.2024.119641

Copy DOI

Export

Save

Cite

Journal: Ocean Engineering

Publication Date: Feb 1, 2025

Abstract
Full-Text
Similar Papers

Abstract

Listen

Navigating autonomous surface vehicles in dynamic marine environments, where uncertainties and disturbances like static or moving obstacles, ocean currents, and waves abound, poses a formidable challenge. Recent advancements in Deep Reinforcement Learning (DRL) have shown promising results in terms of adaptivity and timeliness through interaction with the environment. However, effectively addressing zero safety violations while achieving sample efficiency remains a dual challenge in practical applications. In this paper, we strive to ensure both safety and learning efficiency by combining the advantages of the Dynamic Window Approach (DWA) and safe reinforcement learning. First, a customized simulator for diverse marine conditions is developed, where various types of marine scenarios and algorithms are trained and testified. Then, the problem is formulated as a constrained Markov decision process and the DWA-based safe RL (DWAS-RL) approach is proposed. Specifically, to guarantee safety in the exploration process, we utilize DWA to observe and generate prudent actions by predicting potential near-future hazards, then utilize the safe RL framework for exploration and training. To improve sample efficiency, the technique called Hindsight Experience Replay is utilized to accelerate the training process. Simulation experiments demonstrate the effectiveness of our approach on the metrics of kinematics performance, safety and sample efficiency compared to the state-of-the-art DRL algorithms.

Full Text