Safe reward‐based deep reinforcement learning control for an electro‐hydraulic servo system

Minling Wu,Zhen Yu,Lijun Liu,Weizhou Li

doi:10.1002/rnc.6235

Abstract

AbstractIn this article, a safe deep reinforcement learning (DRL) control method based on a safe reward shaping method is proposed and applied to the constrained control for an electro‐hydraulic servo system (EHSS). The proposed control method improves the safety of the constrained control for a nonlinear system with the minimal intervention to the optimization of the performance objective, while the convergence speed of the DRL process has accelerated. By introducing control barrier functions (CBFs) to the reward shaping, a CBF‐based potential difference term is designed to shape the safe reward, which not only provides the safe guidance for the DRL process by encoding the safety constraints of the nonlinear system, but also considers effects of the complex safety transformation on the convergence process in the DRL. Then the safe reward‐based DRL control method is presented to learn the optimal safety policy of position tracking for the EHSS with position error constraints by planning and optimizing the safety together with the performance objective. Theoretical analysis is given to demonstrate that the proposed control method with the safe reward can achieve the optimal safety performance for the constrained control system. Experimental results of the constrained control for the EHSS with system uncertainties and perturbations are also exhibited, to show that the proposed control method converges fast and performs safer and better than the conventional control methods.

Full Text