Probabilistic dual heuristic programming-based adaptive critic

Randa Herzallah

doi:10.1080/00207720903045767

Abstract

Adaptive critic (AC) methods have common roots as generalisations of dynamic programming for neural reinforcement learning approaches. Since they approximate the dynamic programming solutions, they are potentially suitable for learning in noisy, non-linear and non-stationary environments. In this study, a novel probabilistic dual heuristic programming (DHP)-based AC controller is proposed. Distinct to current approaches, the proposed probabilistic (DHP) AC method takes uncertainties of forward model and inverse controller into consideration. Therefore, it is suitable for deterministic and stochastic control problems characterised by functional uncertainty. Theoretical development of the proposed method is validated by analytically evaluating the correct value of the cost function which satisfies the Bellman equation in a linear quadratic control problem. The target value of the probabilistic critic network is then calculated and shown to be equal to the analytically derived correct value. Full derivation of the Riccati solution for this non-standard stochastic linear quadratic control problem is also provided. Moreover, the performance of the proposed probabilistic controller is demonstrated on linear and non-linear control examples.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Probabilistic dual heuristic programming-based adaptive critic

Abstract

Talk to us

Similar Papers

More From: International Journal of Systems Science

Lead the way for us

Journal: International Journal of Systems Science	Publication Date: Feb 1, 2010
Citations: 4

Similar Papers

English
...
-
, et. al. ...
01 Jan 2008
01 Jan 2008

Characterization of optimal feedback for stochastic linear quadratic control problems
Qi Lü ... Xu Zhang
Probability, Uncertainty and Quantitative Risk | VOL. 2
Qi Lü, et. al.Qi Lü ... Xu Zhang
27 Sep 2017
Probability, Uncertainty and Quantitative Risk | VOL. 2

On Deterministic and Stochastic Linear Quadratic Control Problems
Tijana Levajković ... Hermann Mena
-
Tijana Levajković, et. al.Tijana Levajković ... Hermann Mena
01 Jan 2015
01 Jan 2015

Numerical solution of the finite horizon stochastic linear quadratic control problem
Tobias Damm ... Tony Stillfjord
Numerical Linear Algebra with Applications | VOL. 24
Tobias Damm, et. al.Tobias Damm ... Tony Stillfjord
17 Mar 2017
Numerical Linear Algebra with Applications | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Probabilistic dual heuristic programming-based adaptive critic

Abstract

Talk to us

Similar Papers

More From: International Journal of Systems Science