Path Planning Research of a UAV Base Station Searching for Disaster Victims' Location Information Based on Deep Reinforcement Learning.

Jinduo Zhao,Zhigao Gan,Yilin Li,Chao Wang,Wenjun Li,Ruixue Li,Jiakai Liang,Keqiang Yue

doi:10.3390/e24121767

Jinduo Zhao, Zhigao Gan + Show 6 more

Open Access

https://doi.org/10.3390/e24121767

Copy DOI

Journal: Entropy (Basel, Switzerland)	Publication Date: Dec 2, 2022
Citations: 4	License type: CC BY 4.0

Affiliation: Hangzhou Dianzi University

Abstract

Aiming at the path planning problem of unmanned aerial vehicle (UAV) base stations when performing search tasks, this paper proposes a Double DQN-state splitting Q network (DDQN-SSQN) algorithm that combines state splitting and optimal state to complete the optimal path planning of UAV based on the Deep Reinforcement Learning DDQN algorithm. The method stores multidimensional state information in categories and uses targeted training to obtain optimal path information. The method also references the received signal strength indicator (RSSI) to influence the reward received by the agent, and in this way reduces the decision difficulty of the UAV. In order to simulate the scenarios of UAVs in real work, this paper uses the Open AI Gym simulation platform to construct a mission system model. The simulation results show that the proposed scheme can plan the optimal path faster than other traditional algorithmic schemes and has a greater advantage in the stability and convergence speed of the algorithm.

Full Text