Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design

Xin Chen,Min Wu,Yong He,Penghuan Xie,Yonghua Xiong

doi:10.1155/2015/760459

Xin Chen, Min Wu + Show 3 more

Open Access

https://doi.org/10.1155/2015/760459

Copy DOI

Abstract

Adaptive Dynamic Programming (ADP) with critic-actor architecture is an effective way to perform online learning control. To avoid the subjectivity in the design of a neural network that serves as a critic network, kernel-based adaptive critic design (ACD) was developed recently. There are two essential issues for a static kernel-based model: how to determine proper hyperparameters in advance and how to select right samples to describe the value function. They all rely on the assessment of sample values. Based on the theoretical analysis, this paper presents a two-phase simultaneous learning method for a Gaussian-kernel-based critic network. It is able to estimate the values of samples without infinitively revisiting them. And the hyperparameters of the kernel model are optimized simultaneously. Based on the estimated sample values, the sample set can be refined by adding alternatives or deleting redundances. Combining this critic design with actor network, we present a Gaussian-kernel-based Adaptive Dynamic Programming (GK-ADP) approach. Simulations are used to verify its feasibility, particularly the necessity of two-phase learning, the convergence characteristics, and the improvement of the system performance by using a varying sample set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Problems in Engineering	Publication Date: Jan 1, 2015
Citations: 31	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

Adaptive Critic Design with Local Gaussian Process Models
Wei Wang ... Jianxin He
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 20
Wei Wang, et. al.Wei Wang ... Jianxin He
20 Dec 2016
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 20

Tracking control of affine nonlinear discrete-time systems based on Gaussian-kernel-based ADP
Xin Chen ... Min Wu
-
Xin Chen, et. al.Xin Chen ... Min Wu
01 Jul 2016
01 Jul 2016

Linear-quadratic optimal control for unknown mean-field stochastic discrete-time system via adaptive dynamic programming approach
Ruirui Liu ... Xikui Liu
Neurocomputing | VOL. 282
Ruirui Liu, et. al.Ruirui Liu ... Xikui Liu
13 Dec 2017
Neurocomputing | VOL. 282

Single network adaptive critic design for power system stabilisers
G Gurrala ... I Sen
IET Generation, Transmission & Distribution | VOL. 3
G Gurrala, et. al.G Gurrala ... I Sen
01 Sep 2009
IET Generation, Transmission & Distribution | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering