Swarm Cooperative Navigation Using Centralized Training and Decentralized Execution

Rana Azzam,Yahya Zweiri,Igor Boiko

doi:10.3390/drones7030193

Rana Azzam, Yahya Zweiri + Show 1 more

Open Access

https://doi.org/10.3390/drones7030193

Copy DOI

Journal: Drones	Publication Date: Mar 11, 2023
Citations: 4	License type: CC BY 4.0

Affiliation: Khalifa University of Science and Technology

Abstract

The demand for autonomous UAV swarm operations has been on the rise following the success of UAVs in various challenging tasks. Yet conventional swarm control approaches are inadequate for coping with swarm scalability, computational requirements, and real-time performance. In this paper, we demonstrate the capability of emerging multi-agent reinforcement learning (MARL) approaches to successfully and efficiently make sequential decisions during UAV swarm collaborative tasks. We propose a scalable, real-time, MARL approach for UAV collaborative navigation where members of the swarm have to arrive at target locations at the same time. Centralized training and decentralized execution (CTDE) are used to achieve this, where a combination of negative and positive reinforcement is employed in the reward function. Curriculum learning is used to facilitate the sought performance, especially due to the high complexity of the problem which requires extensive exploration. A UAV model that highly resembles the respective physical platform is used for training the proposed framework to make training and testing realistic. The scalability of the platform to various swarm sizes, speeds, goal positions, environment dimensions, and UAV masses has been showcased in (1) a load drop-off scenario, and (2) UAV swarm formation without requiring any re-training or fine-tuning of the agents. The obtained simulation results have proven the effectiveness and generalizability of our proposed MARL framework for cooperative UAV navigation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Swarm Cooperative Navigation Using Centralized Training and Decentralized Execution

Abstract

Talk to us

Similar Papers

More From: Drones

Lead the way for us

Similar Papers

Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach
Yue Chen ... Hehe Zhang
-
Yue Chen, et. al.Yue Chen ... Hehe Zhang
10 May 2021
10 May 2021

On the Role of Reward Functions for Reinforcement Learning in the Traffic Assignment Problem
Ricardo Grunitzki ... Gabriel De Oliveira Ramos
-
Ricardo Grunitzki, et. al.Ricardo Grunitzki ... Gabriel De Oliveira Ramos
01 Jul 2020
01 Jul 2020

Review of the progress of communication-based multi-agent reinforcement learning
涵王 ... 扬俞
SCIENTIA SINICA Informationis | VOL. 52
涵王, et. al.涵王 ... 扬俞
01 May 2022
SCIENTIA SINICA Informationis | VOL. 52

Decentralised Multi-Agent Reinforcement Learning Approach for the Same-Day Delivery Problem
Elvin Ngu ... Panagiotis Angeloudis
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2676
Elvin Ngu, et. al.Elvin Ngu ... Panagiotis Angeloudis
23 Jun 2022
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2676

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Swarm Cooperative Navigation Using Centralized Training and Decentralized Execution

Abstract

Talk to us

Similar Papers

More From: Drones