State-chain sequential feedback reinforcement learning for path planning of autonomous mobile robots

Xin Ma,Li-Xia Deng,Ya Xu,Guo-Qiang Sun,Yi-Bin Li

doi:10.1631/jzus.c1200226

Abstract

This paper deals with a new approach based on Q-learning for solving the problem of mobile robot path planning in complex unknown static environments. As a computational approach to learning through interaction with the environment, reinforcement learning algorithms have been widely used for intelligent robot control, especially in the field of autonomous mobile robots. However, the learning process is slow and cumbersome. For practical applications, rapid rates of convergence are required. Aiming at the problem of slow convergence and long learning time for Q-learning based mobile robot path planning, a state-chain sequential feedback Q-learning algorithm is proposed for quickly searching for the optimal path of mobile robots in complex unknown static environments. The state chain is built during the searching process. After one action is chosen and the reward is received, the Q-values of the state-action pairs on the previously built state chain are sequentially updated with one-step Q-learning. With the increasing number of Q-values updated after one action, the number of actual steps for convergence decreases and thus, the learning time decreases, where a step is a state transition. Extensive simulations validate the efficiency of the newly proposed approach for mobile robot path planning in complex environments. The results show that the new approach has a high convergence speed and that the robot can find the collision-free optimal path in complex unknown static environments with much shorter time, compared with the one-step Q-learning algorithm and the Q(λ)-learning algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

State-chain sequential feedback reinforcement learning for path planning of autonomous mobile robots

Abstract

Talk to us

Similar Papers

More From: Journal of Zhejiang University SCIENCE C

Lead the way for us

Journal: Journal of Zhejiang University SCIENCE C	Publication Date: Mar 1, 2013
Citations: 11

Similar Papers

Autonomous mobile robot navigation algorithm for planning collision-free path designed in dynamic environments
Faten Cherni ... Chokri Rekik
-
Faten Cherni, et. al.Faten Cherni ... Chokri Rekik
01 Oct 2015
01 Oct 2015

An improved Q-Learning algorithm and its application to the optimized path planning for unmanned ground robot with obstacle avoidance
Zekun Bai ... Minhao Liu
-
Zekun Bai, et. al.Zekun Bai ... Minhao Liu
28 Oct 2022
28 Oct 2022

A LSTM Neural Network applied to Mobile Robots Path Planning
Fiorato Nicola ... Yasutaka Fujimoto
-
Fiorato Nicola, et. al.Fiorato Nicola ... Yasutaka Fujimoto
01 Jul 2018
01 Jul 2018

Mobile robots path planning using ant colony optimization and Fuzzy Logic algorithms in unknown dynamic environments
Fatemeh Khosravi Purian ... Ehsan Sadeghian
-
Fatemeh Khosravi Purian, et. al.Fatemeh Khosravi Purian ... Ehsan Sadeghian
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

State-chain sequential feedback reinforcement learning for path planning of autonomous mobile robots

Abstract

Talk to us

Similar Papers

More From: Journal of Zhejiang University SCIENCE C