Optimal tracking of continuous-time non-linear systems has been extensively studied in literature. However, in several applications, absence of knowledge about system dynamics poses a severe challenge in solving the optimal tracking problem. This has found growing attention among researchers recently, and integral reinforcement learning based method augmented with actor neural network (NN) have been deployed to this end. However, very few studies have been directed to model-free H ∞ optimal tracking control that helps in attenuating the effect of disturbances on the system performance without any prior knowledge about system dynamics. To this end, a recursive least square-based parameter update was recently proposed. However, gradient descent-based parameter update scheme is more sensitive to real-time variation in plant dynamics. Experience replay (ER) technique has been shown to improve the convergence of NN weights by utilising past observations iteratively. Motivated by these, this study presents a novel parameter update law based on variable gain gradient descent and ER technique for tuning the weights of critic, actor and disturbance NNs. The presented update law leads to improved model-free tracking performance under L 2 -bounded disturbance. Simulation results are presented to validate the presented update law.
Read full abstract