Second-Order Training of Adaptive Critics for Online Process Control

J.J Govindhasamy,S.F Mcloone,G.W Irwin

doi:10.1109/tsmcb.2004.843276

Second-Order Training of Adaptive Critics for Online Process Control

J.J Govindhasamy, S.F Mcloone + Show 1 more

Open Access

https://doi.org/10.1109/tsmcb.2004.843276

Copy DOI

Journal: IEEE transactions on systems, man, and cybernetics	Publication Date: Apr 1, 2005
Citations: 45

Affiliation: Queen's University Belfast

#Recursive Levenberg Marquardt #Levenberg Marquardt Training Algorithm + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper deals with reinforcement learning for process modeling and control using a model-free, action- dependent adaptive critic (ADAC). A new modified recursive Levenberg Marquardt (RLM) training algorithm, called temporal difference RLM, is developed to improve the ADAC performance. Novel application results for a simulated continuously-stirred-tank-reactor process are included to show the superiority of the new algorithm to conventional temporal-difference stochastic backpropagation.

Full Text