A control strategy of normal motion and active self-rescue for autonomous underwater vehicle based on deep reinforcement learning

Yuan Fang,Yuxuan Cao,Chengren Yuan,Shuyong Liu,Jinyun Pu

doi:10.1063/5.0076857

Yuan Fang, Yuxuan Cao + Show 3 more

Open Access

https://doi.org/10.1063/5.0076857

Copy DOI

Journal: AIP Advances	Publication Date: Jan 1, 2022
Citations: 6	License type: cc-by

Affiliation: Naval University of Engineering

Abstract

The survivability of autonomous underwater vehicles (AUV) in complex missions and dangerous situations is of great significance to ocean resource exploration, hydrological research, maritime rescue, and undersea military. Existing researches on motion control for the AUV mainly focus on its normal operating, but the active self-rescue method in emergency situations is hardly found. As classical control methods are not sufficient enough for complicated self-rescue missions of the AUV, this paper uses the deep reinforcement learning (DRL) algorithm to solve this problem because the DRL algorithm has the advantages in learning and decision making for complex robot control missions. In this paper, the normal motion control of the AUV based on the deep deterministic policy gradient algorithm is explored, including the yaw angle adjustment, yaw angle adjustment extension, trajectory tracking, and normal floating-up control of the AUV. Then, active self-rescue methods are successfully achieved to recover the AUV from emergencies, such as ocean water density decreasing sharply or one fin getting jammed at a random angle. What is more, real environment experiments are successfully conducted on a self-developed platform of the AUV to validate the feasibility of the proposed control methods. The results can effectively improve the survivability of the AUV and can be a reference to submarine survivability technologies.

Highlights

Autonomous underwater vehicle (AUV) is playing an irreplaceable part in ocean resource exploration, hydrological research, underwater military, maritime rescue, and others
This article proposes an attitude-based control strategy to reach the goals of motion control for normal operating, as well as active self-rescue for emergencies, such as ocean water density decreases sharply or one fin gets jammed at a random angle
Four agents are trained in the simulation environments: Agent 1 accomplishes the yaw angle adjustment task with a small range, Agent 2 deals with the standard floating-up task, Agent 3 carries out the floating-up task when ocean water density decreases sharply, and Agent 4 for the floating-up task when one fin gets jammed at a random angle

Summary

INTRODUCTION

Autonomous underwater vehicle (AUV) is playing an irreplaceable part in ocean resource exploration, hydrological research, underwater military, maritime rescue, and others. This article proposes an attitude-based control strategy to reach the goals of motion control for normal operating, as well as active self-rescue for emergencies, such as ocean water density decreases sharply or one fin gets jammed at a random angle. The active self-rescue research of AUV is rare because its control is too difficult for variable emergency types in the hostile underwater environment. Taking the great potential of the DRL algorithm into account for the control of AUV, this paper adopts a state-of-art DRL algorithm to realize the normal motion control and active self-rescue control on a physics simulator and conducts experiments with a selfdeveloped X-rudder AUV. (3) An active self-rescue method based on the DDPG algorithm for AUV is proposed to realize the autonomous recovery in emergencies when the seawater density decreases or the fin of AUV is jammed. (1) After the agent model for the yaw angle adjustment task with a small range is trained via the DDPG algorithm, a yaw angle adjustment extension method is adopted to enable the AUV to realize the yaw angle adjustment with a large range. (2) Based on the yaw angle adjustment control strategy, a trajectory tracking control strategy is demonstrated by a sinusoids curve tracking simulation experiment. (3) An active self-rescue method based on the DDPG algorithm for AUV is proposed to realize the autonomous recovery in emergencies when the seawater density decreases or the fin of AUV is jammed. (4) A self-developed AUV is used in real experiments to verify the feasibility of the proposed control strategies

Mathematical model

Deep deterministic policy gradient

Yaw angle adjustment extension method

Reward function

The active self-rescue methods

Fixed yaw angle adjustment task

Sinusoid trajectory tracking task

Normal floating-up task

Floating-up task with ocean density reduction

Floating-up task with Fin 4 jammed

EXPERIMENTS

Navigation with fin jam and no agent control

CONCLUSIONS

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A control strategy of normal motion and active self-rescue for autonomous underwater vehicle based on deep reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: AIP Advances

Lead the way for us

Similar Papers

An explainable deep reinforcement learning algorithm for the parameter configuration and adjustment in the consortium blockchain
Zhonghao Zhai ... Yanqin Mao
Engineering Applications of Artificial Intelligence | VOL. 129
Zhonghao Zhai, et. al.Zhonghao Zhai ... Yanqin Mao
30 Nov 2023
Engineering Applications of Artificial Intelligence | VOL. 129

Intelligent Control of Manipulator Based on Deep Reinforcement Learning
Jiangtao Zhou ... Hua Zheng
-
Jiangtao Zhou, et. al.Jiangtao Zhou ... Hua Zheng
16 Jul 2021
16 Jul 2021

Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Yong Ma ... Yuanzhou Zheng
Maritime Policy & Management | VOL. 47
Yong Ma, et. al.Yong Ma ... Yuanzhou Zheng
12 May 2020
Maritime Policy & Management | VOL. 47

Medical Equipment Supply Chain Optimization and Stability Study using Deep Reinforcement Learning
Zhuoxun Chen
Highlights in Science, Engineering and Technology | VOL. 68
Zhuoxun ChenZhuoxun Chen
09 Oct 2023
Highlights in Science, Engineering and Technology | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A control strategy of normal motion and active self-rescue for autonomous underwater vehicle based on deep reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: AIP Advances