EPPE: An Efficient Progressive Policy Enhancement framework of deep reinforcement learning in path planning

Wang Zhao,Ye Zhang,Zikang Xie

doi:10.1016/j.neucom.2024.127958

Abstract

Path planning is a key process in robotics, playing an important role in fields such as autonomous driving and logistic delivery. Our work addresses the dual challenges of training efficiency and composite optimization in path planning using Deep Reinforcement Learning (DRL). We introduce the Efficient Progressive Policy Enhancement (EPPE) framework, which integrates the advantages of sparse rewards, aimed at achieving a globally optimal policy for the agent, with process rewards that provide real-time feedback for the agent’s policy adjustment. This framework not only significantly enhances policy learning efficiency but also effectively resolves the reward coupling issues introduced by process rewards, thereby ensuring the achievement of a globally optimal policy. Within this framework, the initial reward structure incorporates guiding rewards, which are a type of process reward based on conventional path planning algorithms, and assigns significant weights to provide real-time feedback, thereby effectively enhancing the training efficiency. Additionally, the Incremental Reward Adjustment (IRA) model is proposed to progressively increase the reward weights in the composite optimization part. The Fine-tuning Policy Optimization (FPO) model, supporting the IRA model, makes gradual adjustments to the learning rate throughout the entire process. Simulated experiments demonstrate the advantage of our framework in path composite optimization. In static obstacle environments, compared to seven benchmark algorithms, the time and distance to reach the target are improved by at least 10.4%. In mixed obstacle environments, these improvements are at least 19.1% and 18.2%. Additionally, our framework also significantly enhances the training efficiency of DRL.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EPPE: An Efficient Progressive Policy Enhancement framework of deep reinforcement learning in path planning

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: May 31, 2024
Citations: 1

Similar Papers

A K-means Supported Reinforcement Learning Framework to Multi-dimensional Knapsack
Sabah Bushaj ... İ Esra Büyüktahtakın
Journal of Global Optimization | VOL. 89
Sabah Bushaj, et. al.Sabah Bushaj ... İ Esra Büyüktahtakın
15 Feb 2024
Journal of Global Optimization | VOL. 89

An Automated Deep Reinforcement Learning Pipeline for Dynamic Pricing
Reza Refaei Afshar ... Uzay Kaymak
IEEE Transactions on Artificial Intelligence | VOL. 4
Reza Refaei Afshar, et. al.Reza Refaei Afshar ... Uzay Kaymak
01 Jun 2023
IEEE Transactions on Artificial Intelligence | VOL. 4

Deep Reinforcement Learning for Cascaded Hydropower Reservoirs Considering Inflow Forecasts
Wei Xu ... Anbang Peng
Water Resources Management | VOL. 34
Wei Xu, et. al.Wei Xu ... Anbang Peng
27 Jun 2020
Water Resources Management | VOL. 34

Deep reinforcement learning-based radio function deployment for secure and resource-efficient NG-RAN slicing
Pengfei Zhu ... Yuefeng Ji
Engineering Applications of Artificial Intelligence | VOL. 106
Pengfei Zhu, et. al.Pengfei Zhu ... Yuefeng Ji
11 Oct 2021
Engineering Applications of Artificial Intelligence | VOL. 106

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EPPE: An Efficient Progressive Policy Enhancement framework of deep reinforcement learning in path planning

Abstract

Talk to us

Similar Papers

More From: Neurocomputing