Data-driven approximate value iteration with optimality error bound analysis

Yongqiang Li,Zhongsheng Hou,Yuanjing Feng,Ronghu Chi

doi:10.1016/j.automatica.2016.12.019

Abstract

Features of the data-driven approximate value iteration (AVI) algorithm, proposed in Li et al. (2014) for dealing with the optimal stabilization problem, include that only process data is required and that the estimate of the domain of attraction for the closed-loop is enlarged. However, the controller generated by the data-driven AVI algorithm is an approximate solution for the optimal control problem. In this work, a quantitative analysis result on the error bound between the optimal cost and the cost under the designed controller is given. This error bound is determined by the approximation error of the estimation for the optimal cost and the approximation error of the controller function estimator. The first one is concretely determined by the approximation error of the data-driven dynamic programming (DP) operator to the DP operator and the approximation error of the value function estimator. These three approximation errors are zeros when the data set of the plant is sufficient and infinitely complete, and the number of samples in the interested state space is infinite. This means that the cost under the designed controller equals to the optimal cost when the number of iterations is infinite.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-driven approximate value iteration with optimality error bound analysis

Abstract

Talk to us

Similar Papers

More From: Automatica

Lead the way for us

Journal: Automatica	Publication Date: Jan 24, 2017
Citations: 16

Similar Papers

A perturbation approach to a class of discounted approximate value iteration algorithms with borel spaces
Joaquín López-Borbón ... Óscar Vega-Amaya
Journal of Dynamics and Games | VOL. 3
Joaquín López-Borbón, et. al.Joaquín López-Borbón ... Óscar Vega-Amaya
01 Aug 2016
Journal of Dynamics and Games | VOL. 3

Approximate dynamic programming via direct search in the space of value function approximations
E.F Arruda ... J.B.R Do Val
European Journal of Operational Research | VOL. 211
E.F Arruda, et. al.E.F Arruda ... J.B.R Do Val
13 Jan 2011
European Journal of Operational Research | VOL. 211

Stability and optimality of a multi-product production and storage system under demand uncertainty
E.F Arruda ... J.B.R Do Val
European Journal of Operational Research | VOL. 188
E.F Arruda, et. al.E.F Arruda ... J.B.R Do Val
01 Jul 2008
European Journal of Operational Research | VOL. 188

Approximate value iteration with randomized policies
D.P De Farias ... B Van Roy
-
D.P De Farias, et. al.D.P De Farias ... B Van Roy
12 Dec 2000
12 Dec 2000

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-driven approximate value iteration with optimality error bound analysis

Abstract

Talk to us

Similar Papers

More From: Automatica