Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance

Tianhao Chen,Zheng Zhang,Zheng Fang,Dong Jiang,Guangliang Li

doi:10.1016/j.oceaneng.2024.117287

Abstract

Autonomous underwater vehicle (AUV) is widely used for complex underwater tasks such as seafloor exploration. In recent years, deep reinforcement learning (DRL) has been introduced to the AUV control due to its capability to improve the autonomy of AUV. However, it is usually very difficult to design an effective reward function for the DRL methods. Generative adversarial imitation learning (GAIL) can allow AUVs to learn control policies from expert demonstrations instead of pre-defined reward functions, but suffers from the deficiency of requiring optimal expert demonstrations and not surpassing the provided demonstrations. This paper builds upon the GAIL algorithm for AUV learning control policies from expert demonstrations. We proposed an importance reweighting generative adversarial imitation learning (WGAIL) algorithm by using confidence scores to indicate the optimality of the demonstrated trajectories, which can facilitate AUVs to learn control policies from expert demonstrations of different levels. Our experimental results on a simulated AUV system modeling Sailfish 210 of our lab in the Gazebo simulation environment show that an AUV trained via WGAIL can achieve a better performance than the one trained via GAIL with different levels of expert sub-optimal demonstrations. Moreover, control policies trained via WGAIL in simple tasks can generalize better to complex tasks than those trained via GAIL, greatly extending the applicability of the AUV learning from expert demonstrations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance

Abstract

Talk to us

Similar Papers

More From: Ocean Engineering

Lead the way for us

Journal: Ocean Engineering	Publication Date: Feb 28, 2024
Citations: 3

Similar Papers

Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle
Dong Jiang ... Guangliang Li
Ocean Engineering | VOL. 260
Dong Jiang, et. al.Dong Jiang ... Guangliang Li
03 Aug 2022
Ocean Engineering | VOL. 260

Goal Conditioned Generative Adversarial Imitation Learning Based on Dueling-DQN
Ziqi Xu ... Shaofan Wang
-
Ziqi Xu, et. al.Ziqi Xu ... Shaofan Wang
01 Jan 2023
01 Jan 2023

End-to-End AUV Motion Planning Method Based on Soft Actor-Critic.
Xin Yu ... Yushan Sun
Sensors | VOL. 21
Xin Yu, et. al.Xin Yu ... Yushan Sun
01 Sep 2021
Sensors | VOL. 21

Autonomous underwater vehicle formation control and obstacle avoidance using multi-agent generative adversarial imitation learning
Zheng Fang ... Guangliang Li
Ocean Engineering | VOL. 262
Zheng Fang, et. al.Zheng Fang ... Guangliang Li
27 Aug 2022
Ocean Engineering | VOL. 262

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance

Abstract

Talk to us

Similar Papers

More From: Ocean Engineering