Path planning of UAV base station based on deep reinforcement learning

Siming Yang,Zheng Shan,Jiang Cao,Yuan Gao,Yang Guo,Ping Wang,Xiaonan Wang,Jing Wang,Tingting Zhang,Jiayu Guo

doi:10.1016/j.procs.2022.04.013

Siming Yang, Zheng Shan + Show 8 more

Open Access

https://doi.org/10.1016/j.procs.2022.04.013

Copy DOI

Journal: Procedia computer science	Publication Date: Jan 1, 2022
Citations: 6	License type: cc-by-nc-nd

Affiliation: PLA Academy of Military Science

Abstract

UAV base station platform has become the current research hotspot of assisting ground base station for wireless coverage.At present, the most important issue is how to make path planning to provide the stable communication guarantee for multiple mobile users. In this article, we model the air-to-ground channel to describe the path loss between the UAV platform and the user and build a simulation environment for training based on the OpenAI-GYM architecture. In addition, this paper proposes a reinforcement learning algorithm based on intrinsic rewards, which uses the mean square error of the state prediction results to quantify the novelty of the state. Algorithms enable agents to efficiently carry out strategy iterations. Experiments results showed that our algorithm has a higher score and takes less time.

Full Text