Mobile Robot Path Planning Based on Improved DDPG Reinforcement Learning Algorithm

Yuansheng Dong,Xingjie Zou

doi:10.1109/icsess49938.2020.9237641

Abstract

Mobile robotics has a wide range of applications and path planning is key to its realization. Mobile robots need to explore the environment autonomously to find their destinations. The Deep Deterministic Policy Gradient (DDPG) algorithm, a classical algorithm in deep reinforcement learning, has a large advantage in continuous control problems. However, the DDPG algorithm suffers from the problems of low training efficiency and slow convergence caused by the high proportion of illegal policies due to the lack of policy action filtering. In this paper, we propose a mobile robot path planning method based on an improved DDPG reinforcement learning algorithm, which uses a small amount of a priori knowledge to accelerate the training of deep reinforcement learning, reduce the number of trial and error, and adopt an adaptive exploration method based on the $\varepsilon$ -greedy algorithm. Dynamically adjust the exploration factor to rationally allocate the probability of exploration and exploitation. The adaptive exploration method can improve the exploration efficiency, reduce the exploration duration and speed up the convergence of the algorithm. Simulation experiments are conducted in a grid environment, and the results show that the proposed algorithm can successfully find the optimal path. Moreover, the comparison experiments between Q-learning, SARSA and the proposed algorithm demonstrate that the proposed algorithm has better path planning performance, spends the least computation time and converges the fastest.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mobile Robot Path Planning Based on Improved DDPG Reinforcement Learning Algorithm

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A path planning algorithm fusion of obstacle avoidance and memory functions
Qingchun Zheng ... Shubo Li
Cognitive Computation and Systems | VOL. 5
Qingchun Zheng, et. al.Qingchun Zheng ... Shubo Li
01 Dec 2023
Cognitive Computation and Systems | VOL. 5

Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm.
Shuzhen Zhang ... Fusheng Zha
Sensors (Basel, Switzerland) | VOL. 24
Shuzhen Zhang, et. al.Shuzhen Zhang ... Fusheng Zha
30 Aug 2024
Sensors (Basel, Switzerland) | VOL. 24

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

Research on Decision-Making Method of Unmanned Tractor-Trailer Based on T-DDPG
Jian Wang ... Zhiyuan Li
-
Jian Wang, et. al.Jian Wang ... Zhiyuan Li
07 Aug 2022
07 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mobile Robot Path Planning Based on Improved DDPG Reinforcement Learning Algorithm

Abstract

Talk to us

Similar Papers