A Fast and Robust Algorithm with Reinforcement Learning for Large UAV Cluster Mission Planning

Lei Zuo,Xiaofei Lu,Shan Gao,Yachao Li,Lianghai Li,Ming Li

doi:10.3390/rs14061304

Abstract

Large Unmanned Aerial Vehicle (UAV) clusters, containing hundreds of UAVs, have widely been used in the modern world. Therein, mission planning is the core of large UAV cluster collaborative systems. In this paper, we propose a mission planning method by introducing the Simple Attention Model (SAM) into Dynamic Information Reinforcement Learning (DIRL), named DIRL-SAM. To reduce the computational complexity of the original attention model, we derive the SAM with a lightweight interactive model to rapidly extract high-dimensional features of the cluster information. In DIRL, dynamic training conditions are considered to simulate different mission environments. Meanwhile, the data expansion in DIRL guarantees the convergence of the model in these dynamic environments, which improves the robustness of the algorithm. Finally, the simulation experiment results show that the proposed method can adaptively provide feasible mission planning schemes with second-level solution speed and that it exhibits excellent generalization performance in large-scale cluster planning problems.

Highlights

Unmanned aerial vehicle (UAV) clusters have been widely used to perform various complex missions in military and civil fields, such as plant protection, mobile signal service, load transportation service, target detection, and strike [1–6]
Parameter settings of the contrast optimization algorithms: We compare our method with two effective heuristic optimization algorithms: genetic algorithm (GA) and particle swarm optimization algorithm (PSO) [39,40]
It can prove that the Dynamic Information Reinforcement Learning (DIRL)-Simple Attention Model (SAM) can adaptively allocate UAV groups for each mission in real time and is a practical algorithm to solve large UAV cluster mission planning problems

Summary

Introduction

Unmanned aerial vehicle (UAV) clusters have been widely used to perform various complex missions in military and civil fields, such as plant protection, mobile signal service, load transportation service, target detection, and strike [1–6]. Based on the constructing optimization model and the objective function, choosing special mathematical methods to solve the problem, i.e., gradient descent, dynamic programming algorithm. We address the large UAV cluster collaborative mission planning problem, where the cluster needs to adaptively assign reasonable UAV subgroups for completing many different missions in real-time To this end, a fast and robust method is proposed, named dynamical information reinforcement learning (DIRL) with the simple attention model (SAM). The novel DIRL is proposed by importing mission information in the UAV data during the dynamical training process, i.e., the mission’s requirement constraints, environment influence factor, location, and the weights between different objective functions. The resulting DIRL-SAM method can provide mission planning schemes for different missions in real time with a one-trained model, proving that it is fast and robust.

Mission and UAV Formulation

Illustration of mission planning planning in in aa large large UAV

Objective

Multiple Objective Functions of Mission

Constraint Conditions of Mission Planning

DIRL-SAM

Encoder

Process

DIRL unsupervised thedynamic dynamic

Experimental Settings

Simulation

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Mar 8, 2022
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Fast and Robust Algorithm with Reinforcement Learning for Large UAV Cluster Mission Planning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

GLIDE: Multi-Agent Deep Reinforcement Learning for Coordinated UAV Control in Dynamic Military Environments
Divija Swetha Gadiraju ... Prasenjit Karmakar
Information | VOL. 15
Divija Swetha Gadiraju, et. al.Divija Swetha Gadiraju ... Prasenjit Karmakar
11 Aug 2024
Information | VOL. 15

Effective Cooperative UAV Searching Using Adaptive STGM Mobility Model in a FANET
Xianfeng Li ... Jianfeng Li
-
Xianfeng Li, et. al.Xianfeng Li ... Jianfeng Li
01 Dec 2018
01 Dec 2018

Design and Implementation of a mission planner for Mutiple UCAVs in a SEAD mission
Bram Vandermeersch ... Michiel Selier
-
Bram Vandermeersch, et. al.Bram Vandermeersch ... Michiel Selier
19 Jun 2005
19 Jun 2005

UAV cooperative search in dynamic environment based on hybrid-layered APF
Rui Shao ... Yuhao Yang
EURASIP Journal on Advances in Signal Processing | VOL. 2021
Rui Shao, et. al.Rui Shao ... Yuhao Yang
23 Oct 2021
EURASIP Journal on Advances in Signal Processing | VOL. 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Fast and Robust Algorithm with Reinforcement Learning for Large UAV Cluster Mission Planning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing