Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making

Yueqi Hou,Xiaolong Liang,Maolong Lv,Qisong Yang,Yang Li

doi:10.1016/j.engappai.2023.106703

Abstract

Unmanned Aerial Vehicle (UAV) maneuver strategy learning remains a challenge when using Reinforcement Learning (RL) in this sparse reward task. In this paper, we propose Subtask-Masked curriculum learning for RL (SubMas-RL), an efficient RL paradigm that implements curriculum learning and knowledge transfer for UAV maneuver scenarios involving multiple missiles. First, this study introduces a novel concept known as subtask mask to create source tasks from a target task by masking partial subtasks. Then, a subtask-masked curriculum generation method is proposed to generate a sequenced curriculum by alternately conducting task generation and task sequencing. To establish efficient knowledge transfer and avoid negative transfer, this paper employs two transfer techniques, policy distillation and policy reuse, along with an explicit transfer condition that masks irrelevant knowledge. Experimental results demonstrate that our method achieves a 94.8% success rate in the UAV maneuver scenario, where the direct use of reinforcement learning always fails. The proposed RL framework SubMas-RL is expected to learn an effective policy in complex tasks with sparse rewards.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Jul 6, 2023
Citations: 3	License type: other-oa

R Discovery Prime

R Discovery Prime

Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Similar Papers

Computational Aeromechanics of a Manuevering Unmanned Aerial Vehicle with Variable-Incidence Wings
V Dwivedi ... M Damodaran
Journal of Aircraft | VOL. 52
V Dwivedi, et. al.V Dwivedi ... M Damodaran
03 Jun 2015
Journal of Aircraft | VOL. 52

Intelligent joint trajectory design and resource allocation in UAV-based data harvesting system
Siyu Luo ... Jienan Chen
-
Siyu Luo, et. al.Siyu Luo ... Jienan Chen
09 Oct 2020
09 Oct 2020

Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms
Marwan Abdou Dhuheir ... Emna Baccour
IEEE Internet of Things Journal | VOL. 10
Marwan Abdou Dhuheir, et. al.Marwan Abdou Dhuheir ... Emna Baccour
01 May 2023
IEEE Internet of Things Journal | VOL. 10

A new consensus theory-based method for formation control and obstacle avoidance of UAVs
Yu Wu ... Yanting Huang
Aerospace Science and Technology | VOL. 107
Yu Wu, et. al.Yu Wu ... Yanting Huang
05 Nov 2020
Aerospace Science and Technology | VOL. 107

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence