Study on UAV obstacle avoidance algorithm based on deep recurrent double Q network

Yao Wei,Yao Yang,Zhicheng Liu,Jiaxin Chen,Kai Zhang,Bin Cai

doi:10.1051/jnwpu/20224050970

Abstract

The traditional reinforcement learning method has the problems of overestimation of value function and partial observability in the field of machine motion planning, especially in the obstacle avoidance problem of UAV, which lead to long training time and difficult convergence in the process of network training. This paper proposes an obstacle avoidance algorithm for UAVs based on a deep recurrent double Q network. By transforming the single-network structure into a dual-network structure, the optimal action selection and action value estimation are decoupled to reduce the overestimation of the value function. The fully connected layer introduces the GRU recurrent neural network module, and uses the GRU to process the time dimension information, enhance the analyzability of the real neural network, and improve the performance of the algorithm in some observable environments. On this basis, combining with the priority experience playback mechanism, the network convergence is accelerated. Finally, the original algorithm and the improved algorithm are tested in the simulation environment. The experimental results show that the algorithm has better performance in terms of training time, obstacle avoidance success rate and robustness.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University	Publication Date: Oct 1, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Study on UAV obstacle avoidance algorithm based on deep recurrent double Q network

Abstract

Talk to us

Similar Papers

More From: Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

Lead the way for us

Similar Papers

Dynamic obstacle avoidance of vision sensor mobile robot based on prior knowledge
Wei Chu ... Pengwei Zhang
-
Wei Chu, et. al.Wei Chu ... Pengwei Zhang
01 Dec 2020
01 Dec 2020

Comprehensive Performance Assessment of Various NN-based Side-Slip Angle Estimators (ANN-SSE)
Mohamed G Essa ... Omar M Shehata
-
Mohamed G Essa, et. al.Mohamed G Essa ... Omar M Shehata
01 Apr 2021
01 Apr 2021

Obstacle Detection and Avoidance Using Ultrasonic Sensors in Autonomous Robots
Jiajun Ma
Highlights in Science, Engineering and Technology | VOL. 71
Jiajun MaJiajun Ma
28 Nov 2023
Highlights in Science, Engineering and Technology | VOL. 71

Research on obstacle avoidance algorithm for unmanned ground vehicle based on multi-sensor information fusion.
Jiliang Lv ... Shaofeng Du
Mathematical Biosciences and Engineering | VOL. 18
Jiliang Lv, et. al.Jiliang Lv ... Shaofeng Du
01 Jan 2020
Mathematical Biosciences and Engineering | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Study on UAV obstacle avoidance algorithm based on deep recurrent double Q network

Abstract

Talk to us

Similar Papers

More From: Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University