Robotic arm trajectory tracking method based on improved proximal policy optimization

Qingchun Zheng,Peihao Zhu,Wenpeng Ma,Yangyang Zhao,Zhi Peng

doi:10.59277/pra-ser.a.24.3.05

Abstract

To study the method of trajectory tracking for robotic arms, the traditional tracking method has low accuracy and cannot realize the complex tracking tasks. Compared with traditional methods, deep reinforcement learning is an effective scheme with the advantages of robustness and solving complex problems. This study aims to improve the tracking efficiency of robotic arms based on deep reinforcement learning. Thereby, we propose an approach to improve the proximal policy optimization (Improved-PPO) in this paper, which can be applied to multiple degrees of freedom robotic arms for trajectory tracking. In this study, proximal policy optimization (PPO) and model predictive control (MPC) are integrated to provide an effective algorithm for robotic arm applications. MPC is employed for trajectory prediction to design the controller. Further, the Improved-PPO algorithm is employed for trajectory tracking. The Improved-PPO algorithm is further compared with the asynchronous advantage actor-critic (A3C) and PPO algorithms. The simulation results show that the convergence speed of the Improved-PPO algorithm is increased by 84.3% and 15.4% compared with the A3C and PPO algorithms. This method provides a new research concept for robotic arm trajectory tracking.

Full Text