Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

Xiaoru Zhao,Zhiwei Hou,Rennong Yang,Liangsheng Zhong

doi:10.3390/drones8010018

Xiaoru Zhao, Zhiwei Hou + Show 2 more

Open Access

https://doi.org/10.3390/drones8010018

Copy DOI

Journal: Drones	Publication Date: Jan 11, 2024
Citations: 3	License type: CC BY 4.0

Affiliation: Air Force Engineering University, Sun Yat-sen University

Abstract

Dedicated to meeting the growing demand for multi-agent collaboration in complex scenarios, this paper introduces a parameter-sharing off-policy multi-agent path planning and the following approach. Current multi-agent path planning predominantly relies on grid-based maps, whereas our proposed approach utilizes laser scan data as input, providing a closer simulation of real-world applications. In this approach, the unmanned aerial vehicle (UAV) uses the soft actor–critic (SAC) algorithm as a planner and trains its policy to converge. This policy enables end-to-end processing of laser scan data, guiding the UAV to avoid obstacles and reach the goal. At the same time, the planner incorporates paths generated by a sampling-based method as following points. The following points are continuously updated as the UAV progresses. Multi-UAV path planning tasks are facilitated, and policy convergence is accelerated through sharing experiences among agents. To address the challenge of UAVs that are initially stationary and overly cautious near the goal, a reward function is designed to encourage UAV movement. Additionally, a multi-UAV simulation environment is established to simulate real-world UAV scenarios to support training and validation of the proposed approach. The simulation results highlight the effectiveness of the presented approach in both the training process and task performance. The presented algorithm achieves an 80% success rate to guarantee that three UAVs reach the goal points.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Drones

Lead the way for us

Similar Papers

Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method
Yu Chen ... Jinyu Wang
Drones | VOL. 7
Yu Chen, et. al.Yu Chen ... Jinyu Wang
23 Dec 2022
Drones | VOL. 7

3M-RL: Multi-Resolution, Multi-Agent, Mean-Field Reinforcement Learning for Autonomous UAV Routing
Weichang Wang ... Rayadurgam Srikant
IEEE Transactions on Intelligent Transportation Systems | VOL. 23
Weichang Wang, et. al.Weichang Wang ... Rayadurgam Srikant
01 Jul 2022
IEEE Transactions on Intelligent Transportation Systems | VOL. 23

A Load-Balanced and Energy-Efficient Navigation Scheme for UAV-Mounted Mobile Edge Computing
Zhenqian Wang ... Huigui Rong
IEEE Transactions on Network Science and Engineering | VOL. 9
Zhenqian Wang, et. al.Zhenqian Wang ... Huigui Rong
01 Sep 2022
IEEE Transactions on Network Science and Engineering | VOL. 9

Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning
Harald Bayerlein ... Mirco Theile
IEEE Open Journal of the Communications Society | VOL. 2
Harald Bayerlein, et. al.Harald Bayerlein ... Mirco Theile
01 Jan 2020
IEEE Open Journal of the Communications Society | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Drones