Energy Efficient 3-D UAV Control for Persistent Communication Service and Fairness: A Deep Reinforcement Learning Approach

Hang Qi,Xiangming Wen,Zhaoming Lu,Hao Huang,Zhiqun Hu

doi:10.1109/access.2020.2981403

Hang Qi, Xiangming Wen + Show 3 more

Open Access

PDF Available

https://doi.org/10.1109/access.2020.2981403

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Recently, unmanned aerial vehicles (UAVs) as flying wireless communication platform have attracted much attention. Benefiting from the mobility, UAV aerial base stations can be deployed quickly and flexibly, and can effectively establish Line-of-Sight communication links. However, there are many challenges in UAV communication system. The first challenge is energy constraint, where the UAV battery lifetime is in the order of fraction of an hour. The second challenge is that the coverage area of UAV aerial base station is limited and the commercial UAV is usually expensive. Thus, covering a large target region all the time with sufficient UAVs is quite challenging. To solve above challenges, in this paper, we propose energy efficient and fair 3-D UAV scheduling with energy replenishment, where UAVs move around to serve users and recharge timely to replenish energy. Inspired by the success of deep reinforcement learning, we propose a UAV Control policy based on Deep Deterministic Policy Gradient (UC-DDPG) to address the combination problem of 3-D mobility of multiple UAVs and energy replenishment scheduling, which ensures energy efficient and fair coverage of each user in a large region and maintains the persistent service. Simulation results reveal that UC-DDPG shows a good convergence and outperforms other scheduling algorithms in terms of data volume, energy efficiency and fairness.

Highlights

Unmanned aerial vehicle (UAV) as flying wireless communication platform is a promising technology to enhance the wireless network with its inherent attributes such as mobility, flexibility and adaptive altitude [1]
Different from the aforementioned existing works under the assumption of either 2-D or stationary UAV coverage, inspired by the success of deep reinforcement learning (DRL), we propose a UAV Control policy based on Deep Deterministic Policy Gradient (DDPG) algorithm [17] (UC-DDPG) to address the combination problem of 3-D mobility of multiple UAVs and energy replenishment scheduling, which ensures energy efficient and fair coverage of each ground user in a large target region, while maintaining the persistent service
In order to improve energy efficiency and guarantee service fairness, we develop a 3-D UAV deployment scheduling algorithm based on DDPG algorithm, which takes the residual energy of UAV, circuit power, communication power, mobility power and hover power into account

Summary

INTRODUCTION

Unmanned aerial vehicle (UAV) as flying wireless communication platform is a promising technology to enhance the wireless network with its inherent attributes such as mobility, flexibility and adaptive altitude [1]. Different from the aforementioned existing works under the assumption of either 2-D or stationary UAV coverage, inspired by the success of DRL, we propose a UAV Control policy based on Deep Deterministic Policy Gradient (DDPG) algorithm [17] (UC-DDPG) to address the combination problem of 3-D mobility of multiple UAVs and energy replenishment scheduling, which ensures energy efficient and fair coverage of each ground user in a large target region, while maintaining the persistent service. The works in [10] proposed a framework to achieve energy-efficient uplink data collection from ground IoT devices by jointly optimizing the 3-D placement, device-UAV association and uplink power control in single time slot.

DATE RATE MODEL

ENERGY MODEL

PROBLEM DEFINITION

PRELIMINARIES ON DDPG

UAV CONTROL BASED ON DDPG

18: Update the target networks:

SIMULATION AND PERFORMANCE EVALUATION

Findings

CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 57	License type: CC BY 4.0

R Discovery Prime

Energy Efficient 3-D UAV Control for Persistent Communication Service and Fairness: A Deep Reinforcement Learning Approach

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Energy-Efficient UAV Control for Effective and Fair Communication Coverage: A Deep Reinforcement Learning Approach
Chi Harold Liu ... Jian Tang
IEEE Journal on Selected Areas in Communications | VOL. 36
Chi Harold Liu, et. al.Chi Harold Liu ... Jian Tang
01 Sep 2018
IEEE Journal on Selected Areas in Communications | VOL. 36

Signaling Protocols for Local Area Networks of Drones
Prabhu Jyot Singh ... Rohan De Silva
International journal of Computer Networks & Communications | VOL. 12
Prabhu Jyot Singh, et. al.Prabhu Jyot Singh ... Rohan De Silva
31 May 2020
International journal of Computer Networks & Communications | VOL. 12

Game Combined Multi-Agent Reinforcement Learning Approach for UAV Assisted Offloading
Ang Gao ... Qi Wang
IEEE Transactions on Vehicular Technology | VOL. 70
Ang Gao, et. al.Ang Gao ... Qi Wang
01 Dec 2021
IEEE Transactions on Vehicular Technology | VOL. 70

Path Following Control for UAV Using Deep Reinforcement Learning Approach
Yintao Zhang ... Ziquan Yu
Guidance, Navigation and Control | VOL. 01
Yintao Zhang, et. al.Yintao Zhang ... Ziquan Yu
01 Mar 2021
Guidance, Navigation and Control | VOL. 01

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Energy Efficient 3-D UAV Control for Persistent Communication Service and Fairness: A Deep Reinforcement Learning Approach

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access