Deep Reinforcement Learning for sim-to-real policy transfer of VTOL-UAVs offshore docking operations

Ali M Ali,Aryaman Gupta,Hashim A Hashim

doi:10.1016/j.asoc.2024.111843

Ali M Ali, Aryaman Gupta + Show 1 more

Open Access

https://doi.org/10.1016/j.asoc.2024.111843

Copy DOI

Export

Save

Cite

Journal: Applied Soft Computing	Publication Date: Jun 12, 2024
Citations: 2	License type: cc-by-nc-nd

Abstract
Full-Text
Similar Papers

Abstract

Listen

This paper proposes a novel Reinforcement Learning (RL) approach for sim-to-real policy transfer of Vertical Take-Off and Landing Unmanned Aerial Vehicle (VTOL-UAV). The proposed approach is designed for VTOL-UAV landing on offshore docking stations in maritime operations. VTOL-UAVs in maritime operations encounter limitations in their operational range, primarily stemming from constraints imposed by their battery capacity. The concept of autonomous landing on a charging platform presents an intriguing prospect for mitigating these limitations by facilitating battery charging and data transfer. However, current Deep Reinforcement Learning (DRL) methods exhibit drawbacks, including lengthy training times, and modest success rates. In this paper, we tackle these concerns comprehensively by decomposing the landing procedure into a sequence of more manageable but analogous tasks in terms of an approach phase and a landing phase. The proposed architecture utilizes a model-based control scheme for the approach phase, where the VTOL-UAV is approaching the offshore docking station. In the Landing phase, DRL agents were trained offline to learn the optimal policy to dock on the offshore station. The Joint North Sea Wave Project (JONSWAP) spectrum model has been employed to create a wave model for each episode, enhancing policy generalization for sim2real transfer. A set of DRL algorithms have been tested through numerical simulations including value-based agents and policy-based agents such as Deep Q Networks (DQN) and Proximal Policy Optimization (PPO) respectively. The numerical experiments show that the PPO agent can learn complicated and efficient policies to land in uncertain environments, which in turn enhances the likelihood of successful sim-to-real transfer.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Deep Reinforcement Learning for sim-to-real policy transfer of VTOL-UAVs offshore docking operations

Abstract

Published Version

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Similar Papers

Assessing generalizability of Deep Reinforcement Learning algorithms for Automated Vulnerability Assessment and Penetration Testing
Andrea Venturi ... Michele Colajanni
Array | VOL. 24
Andrea Venturi, et. al.Andrea Venturi ... Michele Colajanni
27 Sep 2024
Array | VOL. 24

A comparative analysis of reinforcement learning algorithms for earth-observing satellite scheduling
Adam Herrmann ... Hanspeter Schaub
Frontiers in Space Technologies | VOL. 4
Adam Herrmann, et. al.Adam Herrmann ... Hanspeter Schaub
29 Nov 2023
Frontiers in Space Technologies | VOL. 4

A maximum entropy deep reinforcement learning method for sequential well placement optimization using multi-discrete action spaces
Kai Zhang ... Zifeng Sun
Geoenergy Science and Engineering | VOL. 240
Kai Zhang, et. al.Kai Zhang ... Zifeng Sun
06 Jun 2024
Geoenergy Science and Engineering | VOL. 240

Deep Reinforcement Learning for Managing Platelets in a Hospital Blood Bank
Joseph M Farrington ... Martin Utley
Blood | VOL. 142
Joseph M Farrington, et. al.Joseph M Farrington ... Martin Utley
02 Nov 2023
Blood | VOL. 142

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Deep Reinforcement Learning for sim-to-real policy transfer of VTOL-UAVs offshore docking operations

Abstract

Published Version

Talk to us

Similar Papers

More From: Applied Soft Computing