Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks

Nan Zhao,Yiqiang Cheng,Zehua Liu

doi:10.1109/access.2020.3012756

Nan Zhao, Yiqiang Cheng + Show 1 more

Open Access

https://doi.org/10.1109/access.2020.3012756

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 80	License type: CC BY 4.0

Affiliation: Hubei University of Technology

Abstract

Unmanned aerial vehicle (UAV) is regarded as an effective technology in future wireless networks. However, due to the non-convexity feature of joint trajectory design and power allocation (JTDPA) issue, it is challenging to attain the optimal joint policy in multi-UAV networks. In this article, a multi-agent deep reinforcement learning-based approach is presented to achieve the maximum long-term network utility while satisfying the user equipments' quality of service requirements. Moreover, considering that the utility of each UAV is determined based on the network environment and other UAVs' actions, the JTDPA problem is modeled as a stochastic game. Due to the high computational complexity caused by the continuous action space and large state space, a multi-agent deep deterministic policy gradient method is proposed to obtain the optimal policy for the JTDPA issue. Numerical results indicate that our method can obtain the higher network utility and system capacity than other optimization methods in multi-UAV networks with lower computational complexity.

Highlights

Unmanned aerial vehicles (UAVs) have been regarded as an important technology in the future wireless networks [1]
Simulation results indicate that the multi-agent deep deterministic policy gradient (MADDPG) scheme can improve the system capacity and network utility by over 15% with lower computational cost in multi-UAV networks, compared with the other learning optimization approaches
In multi-UAV networks, to ensure that all user equipments (UEs) achieve the quality of service (QoS) requirements from the connected UAVs, the SINR φi,m(t) of UE m should be not less than the minimum QoS requirement m, which can be defined as φi,m(t) ≥ m

Summary

INTRODUCTION

Unmanned aerial vehicles (UAVs) have been regarded as an important technology in the future wireless networks [1]. The problem of trajectory design, power allocation, and interference management should be studied jointly in multi-UAV networks. In this work, we propose a reinforcement learning (RL) method to tackle the JTDPA optimization problem in the multi-UAV networks. Our previous work proposed a DRL approach for trajectory design and power allocation in UAV networks [22]. Most of these centralized methods may achieve an expensive computational complexity. In our previous work [24], an multi-agent dueling-double deep Q-network method was investigated to tackle the joint user association and resource allocation problem. An MADRL method is introduced to tackle the JTDPA optimization problem in multi-UAV networks. M=1 where ρi represents the profit per rate, λp is the cost of UAV’s transmit power

PROBLEM FORMULATION

GAME FORMULATION

MULTI-AGENT DRL METHOD

PERFORMANCE EVALUATION

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Coordinated Longitudinal and Lateral Motions Control of Automated Vehicles Based on Multi-Agent Deep Reinforcement Learning for On-Ramp Merging
Wenchang Li ... Kaichong Liang
-
Wenchang Li, et. al.Wenchang Li ... Kaichong Liang
09 Apr 2024
09 Apr 2024

UAV-Enabled Secure Communications by Multi-Agent Deep Reinforcement Learning
Yu Zhang ... Zhu Han
IEEE Transactions on Vehicular Technology | VOL. 69
Yu Zhang, et. al.Yu Zhang ... Zhu Han
01 Oct 2020
IEEE Transactions on Vehicular Technology | VOL. 69

Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks
Nan Zhao ... Yiyang Pei
-
Nan Zhao, et. al.Nan Zhao ... Yiyang Pei
01 Jun 2020
01 Jun 2020

Power Allocation and Energy Cooperation for UAV-Enabled MmWave Networks: A Multi-Agent Deep Reinforcement Learning Approach.
Mari Carmen Domingo
Sensors | VOL. 22
Mari Carmen DomingoMari Carmen Domingo
30 Dec 2021
Sensors | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access