Path planning for intelligent robots based on deep Q-learning with experience replay and heuristic knowledge

Lan Jiang,Hongyun Huang,Zuohua Ding

doi:10.1109/jas.2019.1911732

Abstract

Path planning and obstacle avoidance are two challenging problems in the study of intelligent robots. In this paper, we develop a new method to alleviate these problems based on deep Q-learning with experience replay and heuristic knowledge. In this method, a neural network has been used to resolve the &#x201C curse of dimensionality &#x201D issue of the Q-table in reinforcement learning. When a robot is walking in an unknown environment, it collects experience data which is used for training a neural network &#x003B such a process is called experience replay. Heuristic knowledge helps the robot avoid blind exploration and provides more effective data for training the neural network. The simulation results show that in comparison with the existing methods, our method can converge to an optimal action strategy with less time and can explore a path in an unknown environment with fewer steps and larger average reward.

Full Text