Abstract

Two controllers, a PI and a PID, are tuned through the twin delayed deep deterministic policy gradient or TD3 algorithm, a reinforcement learning technique that can be applied to systems with continuous action states. For configuring the training environments, two mathematical models are identified, one for a temperature loop and the other for a flow loop, obtaining a first-order model and a FOPDT model, respectively. Furthermore, classical tuning methods are used as a reference to evaluate the performance of controllers tuned by means of an RL agent. The results show that, the algorithm tuning a PI controller with good performance, while the PID controller does not achieve outstanding results compared to traditional methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call