Abstract

We consider the implementation of adaptive critic designs using neural networks. The present scheme is within the general framework of approximate dynamic programming where optimal/suboptimal control is achieved through learning using multilayer feedforward neural networks. We will develop a class of adaptive critic designs that can be classified as (model-free) action-dependent heuristic dynamic programming (ADHDP). We believe that the present ADHDP is equivalent to the conventional model-based HDP since the model network in the latter can be viewed as completely embedded in the critic network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.