Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

Ronglei Xie,Lifeng Wang,Kaipeng Wang,Haochen Li,Zhe Wu,Zhijun Meng

doi:10.1109/access.2021.3057485

Abstract

Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this article, we propose a deep reinforcement learning approach for three-dimensional path planning by utilizing the local information and relative distance without global information. UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. Therefore, path planning can be formulated as a Partially Observable Markov Decision Process. The recurrent neural network with temporal memory is constructed to address the partial observability problem by extracting crucial information from historical state-action sequences. We develop an action selection strategy that combines the current reward value and the state-action value to reduce the meaningless exploration. In addition, we construct two sample memory pools and propose an adaptive experience replay mechanism based on the frequency of failure. The simulation experiment results show that our method has significant improvements over Deep Q-Network and Deep Recurrent Q-Network in terms of stability and learning efficiency. Our approach successfully plans a reasonable three-dimensional path in the large-scale and complex environment, and has the perfect ability to avoid obstacles.in the unknown environment.

Highlights

The unmanned aerial vehicle (UAV) has attracted wide attention in both military and civilian fields because of low cost, flexibility and small size, et [1], [2]
We propose a deep reinforcement learning approach to solve the problem of the UAV path planning in the complex and dynamic environment
The main contributions of this article are summarized as follows: 1) We propose a new action selection strategy by combining the current reward R value and the Q value, which addresses the problem of inaccurate prediction of the neural network at the early stage of training

Summary

INTRODUCTION

The unmanned aerial vehicle (UAV) has attracted wide attention in both military and civilian fields because of low cost, flexibility and small size, et [1], [2]. R. Xie et al.: UAV Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments of different methods can make use of the advantages of each algorithm [15]–[17]. Reinforcement learning essentially obtains the mapping relationship from state to action, which does not involve a complex search process in the decision-making process, so it is suitable for UAV path planning that requires real-time decision-making. There are some challenges of path planning in the large-scale and dynamic environment: 1) The enormous number of states makes the neural network learning slowly and converging difficultly. We propose a deep reinforcement learning approach to solve the problem of the UAV path planning in the complex and dynamic environment.

CONSTRUCTION OF THE ALGORITHM

REWARD DESIGN

IMPROVED ACTION SELECTION STATEGY

ADAPTIVE SAMPING MECHANISM

COMPARISON OF ALGORITHM PERFORMANCE IN A STATIC SCENARIO

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 94	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Chapter 21 - Event-driven programming-based path planning and navigation of UAVs around a complex urban environment
Muhammed Kazim ... Lixian Zhang
Unmanned Aerial Systems | VOL. -
Muhammed Kazim, et. al.Muhammed Kazim ... Lixian Zhang
01 Jan 2020
Unmanned Aerial Systems | VOL. -

The Application of Deep Reinforcement Learning to Distributed Spectrum Access in Dynamic Heterogeneous Environments With Partial Observations
Yue Xu ... Jianyuan Yu
IEEE Transactions on Wireless Communications | VOL. 19
Yue Xu, et. al.Yue Xu ... Jianyuan Yu
01 Jul 2020
IEEE Transactions on Wireless Communications | VOL. 19

Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments.
Xiaoran Kong ... Zhe Li
Frontiers in Neurorobotics | VOL. 17
Xiaoran Kong, et. al.Xiaoran Kong ... Zhe Li
22 Jan 2024
Frontiers in Neurorobotics | VOL. 17

Flight Trajectories Optimization of Fixed-Wing UAV by Bank-Turn Mechanism
Affiani Machmudah ... Teh Sabariah Abd Manan
Drones | VOL. 6
Affiani Machmudah, et. al.Affiani Machmudah ... Teh Sabariah Abd Manan
07 Mar 2022
Drones | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access