A novel Q-learning algorithm based on improved whale optimization algorithm for path planning.

Ying Li,Yanyu Geng,Hanyu Wang,Jiahao Fan

doi:10.1371/journal.pone.0279438

Abstract

Q-learning is a classical reinforcement learning algorithm and one of the most important methods of mobile robot path planning without a prior environmental model. Nevertheless, Q-learning is too simple when initializing Q-table and wastes too much time in the exploration process, causing a slow convergence speed. This paper proposes a new Q-learning algorithm called the Paired Whale Optimization Q-learning Algorithm (PWOQLA) which includes four improvements. Firstly, to accelerate the convergence speed of Q-learning, a whale optimization algorithm is used to initialize the values of a Q-table. Before the exploration process, a Q-table which contains previous experience is learned to improve algorithm efficiency. Secondly, to improve the local exploitation capability of the whale optimization algorithm, a paired whale optimization algorithm is proposed in combination with a pairing strategy to speed up the search for prey. Thirdly, to improve the exploration efficiency of Q-learning and reduce the number of useless explorations, a new selective exploration strategy is introduced which considers the relationship between current position and target position. Fourthly, in order to balance the exploration and exploitation capabilities of Q-learning so that it focuses on exploration in the early stage and on exploitation in the later stage, a nonlinear function is designed which changes the value of ε in ε-greedy Q-learning dynamically based on the number of iterations. Comparing the performance of PWOQLA with other path planning algorithms, experimental results demonstrate that PWOQLA achieves a higher level of accuracy and a faster convergence speed than existing counterparts in mobile robot path planning. The code will be released at https://github.com/wanghanyu0526/improveQL.git.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Dec 27, 2022
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A novel Q-learning algorithm based on improved whale optimization algorithm for path planning.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Research Progress of Nature-Inspired Metaheuristic Algorithms in Mobile Robot Path Planning
Yiqi Xu ... Xuan Xu
Electronics | VOL. 12
Yiqi Xu, et. al.Yiqi Xu ... Xuan Xu
29 Jul 2023
Electronics | VOL. 12

Hybrid Whale Optimization with a Firefly Algorithm for Function Optimization and Mobile Robot Path Planning.
Tao Tian ... Qifang Luo
Biomimetics | VOL. 9
Tao Tian, et. al.Tao Tian ... Qifang Luo
08 Jan 2024
Biomimetics | VOL. 9

Enhanced path planning algorithm via hybrid WOA-PSO for differential wheeled mobile robots
Huda Talib Najm ... Ahmed Sabah Al-Araji
Systems Science & Control Engineering | VOL. 12
Huda Talib Najm, et. al.Huda Talib Najm ... Ahmed Sabah Al-Araji
10 Apr 2024
Systems Science & Control Engineering | VOL. 12

Three-dimensional path planning for autonomous underwater vehicles based on a whale optimization algorithm
Zheping Yan ... Jialing Tang
Ocean Engineering | VOL. 250
Zheping Yan, et. al.Zheping Yan ... Jialing Tang
14 Mar 2022
Ocean Engineering | VOL. 250

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel Q-learning algorithm based on improved whale optimization algorithm for path planning.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE