UAV path planning based on the improved PPO algorithm

Chenyang Qi,Xiaolu Li,Peiyan Cong,Lei Lei,Chengfu Wu

doi:10.1109/arace56528.2022.00040

Abstract

In this paper, we consider the problem of unmanned aerial vehicle (UAV) path planning. The traditional path planning algorithm has the problems of low efficiency and poor adaptability, so this paper uses the reinforcement learning algorithm to complete the path planning. The classic proximal policy optimization (PPO) algorithm has problems that the samples with large rewards in the experience replay buffer will seriously affect training, this situation causes the agent’s exploration performance degradation and the algorithm has poor convergence in some path planning tasks. To solve these problems, this paper proposes a frequency decomposition-PPO algorithm (FD-PPO) based on the frequency decomposition and designs a heuristic reward function to solve the UAV path planning problem. The FD-PPO algorithm decomposes rewards into multi-dimensional frequency rewards, then calculate the frequency return to efficiently guide UAV to complete the path planning task. The simulation results show that the FD-PPO algorithm proposed in this paper can adapt to the complex environment, and has outstanding stability under the continuous state space and continuous action space. At the same time, the FD-PPO algorithm has better performance in path planning than the PPO algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

UAV path planning based on the improved PPO algorithm

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Using On-line Simulation for Adaptive Path Planning of UAVs
Farzad Kamrani ... Rassul Ayani
-
Farzad Kamrani, et. al.Farzad Kamrani ... Rassul Ayani
01 Oct 2007
01 Oct 2007

Travelling salesman problem for UAV path planning with two parallel optimization algorithms
Jie Chen ... Fang Ye
-
Jie Chen, et. al.Jie Chen ... Fang Ye
01 Nov 2017
01 Nov 2017

Survey on computational-intelligence-based UAV path planning
Yijing Zhao ... Yang Liu
Knowledge-Based Systems | VOL. 158
Yijing Zhao, et. al.Yijing Zhao ... Yang Liu
13 Jun 2018
Knowledge-Based Systems | VOL. 158

Neighborhood global learning based flower pollination algorithm and its application to unmanned aerial vehicle path planning
Yang Chen ... Yue Xu
Expert Systems with Applications | VOL. 170
Yang Chen, et. al.Yang Chen ... Yue Xu
24 Dec 2020
Expert Systems with Applications | VOL. 170

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

UAV path planning based on the improved PPO algorithm

Abstract

Talk to us

Similar Papers