Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV's Autonomous Motion Planning in Complex Unknown Environments.

Zijian Hu,Xiaoguang Gao,Yiwei Zhai,Qianglong Wang,Kaifang Wan

doi:10.3390/s20071890

Zijian Hu, Xiaoguang Gao + Show 3 more

Open Access

PDF Available

https://doi.org/10.3390/s20071890

Copy DOI

Export

Save

Cite

Journal: Sensors	Publication Date: Mar 29, 2020
Citations: 32	License type: CC BY 4.0

Affiliation: Northwestern Polytechnical University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Autonomous motion planning (AMP) of unmanned aerial vehicles (UAVs) is aimed at enabling a UAV to safely fly to the target without human intervention. Recently, several emerging deep reinforcement learning (DRL) methods have been employed to address the AMP problem in some simplified environments, and these methods have yielded good results. This paper proposes a multiple experience pools (MEPs) framework leveraging human expert experiences for DRL to speed up the learning process. Based on the deep deterministic policy gradient (DDPG) algorithm, a MEP–DDPG algorithm was designed using model predictive control and simulated annealing to generate expert experiences. On applying this algorithm to a complex unknown simulation environment constructed based on the parameters of the real UAV, the training experiment results showed that the novel DRL algorithm resulted in a performance improvement exceeding 20% as compared with the state-of-the-art DDPG. The results of the experimental testing indicate that UAVs trained using MEP–DDPG can stably complete a variety of tasks in complex, unknown environments.

Highlights

The number of applications for unmanned aerial vehicles (UAVs) is widely increasing in the civil arena such as surveillance [1,2], delivery of goods [3,4], power line inspection [5,6], and mapping [7,8].In the majority of these applications, it is necessary for Unmanned aerial aerial vehicle vehicle (UAV) to plan their motion such that they can perform their tasks while avoiding threats in complex, unknown environments.Many traditional path planning algorithms, such as A* algorithm, visibility graph algorithm, and free space algorithm, are used to solve the motion planning problem of UAV, but these methods can usually only achieve good results when the environment or map is known
In the majority of these applications, it is necessary for UAVs to plan their motion such that they can perform their tasks while avoiding threats in complex, unknown environments
Simultaneous localization and mapping (SLAM) maps the unknown environment according to position and sensor information of the UAV during its movement in the environment so as to implement the automatic motion planning of the UAV according to the drawn map

Summary

Introduction

The number of applications for unmanned aerial vehicles (UAVs) is widely increasing in the civil arena such as surveillance [1,2], delivery of goods [3,4], power line inspection [5,6], and mapping [7,8]. Many studies used DRL to solve the autonomous motion planning (AMP) problem of UAV and achieved good results, but these studies still have some shortcomings: (1) the models of the UAV and environment can be more complex and realistic; (2) the convergence speed and convergence results of the algorithm can be improved. To address these problems, explorations and experiments were conducted in this study.

Related Work

UAV‘s AMP

Different

Motion Planning Framework for UAVs

Unmanned

RL for UAV’s

MEP–DDPG for Motion Planning

MEP–DDPG Framework

MPC-SA for Expert Experiences

Model Predictive Control

MEP–DDPG Algorithm

Training and Testing Environment

Training in Static Environments

12. Experimental

13. Average

15. The trajectory of a UAV trained

Testing for Tasks with Sudden Threats

Testing for Tasks with Moving Target

Conclusions and Future Work

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV's Autonomous Motion Planning in Complex Unknown Environments.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments
Zijian Hu ... Qianglong Wang
Chinese Journal of Aeronautics | VOL. 34
Zijian Hu, et. al.Zijian Hu ... Qianglong Wang
12 Jan 2021
Chinese Journal of Aeronautics | VOL. 34

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Zijian Hu ... Kaifang Wan
IEEE Transactions on Vehicular Technology | VOL. -
Zijian Hu, et. al.Zijian Hu ... Kaifang Wan
01 Jan 2023
IEEE Transactions on Vehicular Technology | VOL. -

Path Following Control for UAV Using Deep Reinforcement Learning Approach
Yintao Zhang ... Ziquan Yu
Guidance, Navigation and Control | VOL. 01
Yintao Zhang, et. al.Yintao Zhang ... Ziquan Yu
01 Mar 2021
Guidance, Navigation and Control | VOL. 01

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV's Autonomous Motion Planning in Complex Unknown Environments.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors