Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method

Guang Liao,Jian Wang,Dujia Yang,Junan Yang

doi:10.3390/s24216859

Abstract

The multi-UAV target search problem is crucial in the field of autonomous Unmanned Aerial Vehicle (UAV) decision-making. The algorithm design of Multi-Agent Reinforcement Learning (MARL) methods has become integral to research on multi-UAV target search owing to its adaptability to the rapid online decision-making required by UAVs in complex, uncertain environments. In non-cooperative target search scenarios, targets may have the ability to escape. Target probability maps are used in many studies to characterize the likelihood of a target’s existence, guiding the UAV to efficiently explore the task area and locate the target more quickly. However, the escape behavior of the target causes the target probability map to deviate from the actual target’s position, thereby reducing its effectiveness in measuring the target’s probability of existence and diminishing the efficiency of the UAV search. This paper investigates the multi-UAV target search problem in scenarios involving static obstacles and dynamic escape targets, modeling the problem within the framework of decentralized partially observable Markov decision process. Based on this model, a spatio-temporal efficient exploration network and a global convolutional local ascent mechanism are proposed. Subsequently, we introduce a multi-UAV Escape Target Search algorithm based on MAPPO (ETS–MAPPO) for addressing the escape target search difficulty problem. Simulation results demonstrate that the ETS–MAPPO algorithm outperforms five classic MARL algorithms in terms of the number of target searches, area coverage rate, and other metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Journal: Sensors	Publication Date: Oct 25, 2024
License type: CC BY 4.0

Similar Papers

Lessons learned in single-agent and multiagent learning with robot foraging
Z Ren ... A.B Williams
-
Z Ren, et. al.Z Ren ... A.B Williams
10 Nov 2003
10 Nov 2003

Three-Dimensional Trajectory and Resource Allocation Optimization in Multi-Unmanned Aerial Vehicle Multicast System: A Multi-Agent Reinforcement Learning Method
Dongyu Wang ... Hongda Yu
Drones | VOL. 7
Dongyu Wang, et. al.Dongyu Wang ... Hongda Yu
19 Oct 2023
Drones | VOL. 7

Joint optimization of communication and mission performance for multi-UAV collaboration network: A multi-agent reinforcement learning method
Yuan He ... Xijian Luo
Ad Hoc Networks | VOL. 164
Yuan He, et. al.Yuan He ... Xijian Luo
24 Jul 2024
Ad Hoc Networks | VOL. 164

Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
Anna Dai ... Honggang Zhang
-
Anna Dai, et. al.Anna Dai ... Honggang Zhang
21 Oct 2020
21 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method

Abstract

Talk to us

Similar Papers

More From: Sensors