AN RBF BASED NEURO-DYNAMIC APPROACH FOR THE CONTROL OF STOCHASTIC DYNAMIC SYSTEMS

Panagiotis K Patrinos,Haralambos Sarimveis

doi:10.3182/20050703-6-cz-1902.01087

AN RBF BASED NEURO-DYNAMIC APPROACH FOR THE CONTROL OF STOCHASTIC DYNAMIC SYSTEMS

Panagiotis K Patrinos, Haralambos Sarimveis

Open Access

https://doi.org/10.3182/20050703-6-cz-1902.01087

Copy DOI

Journal: IFAC Proceedings Volumes	Publication Date: Jan 1, 2005
Citations: 1

Affiliation: National Technical University of Athens

#Control Of Markov Decision Processes #Post-decision State + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper presents a neuro-dynamic programming methodology for the control of markov decision processes. The proposed method can be considered as a variant of the optimistic policy iteration, where radial basis function (RBF) networks are employed as a compact representation of the cost-to-go function and the Λ-LSPE is used for policy evaluation. We also emphasize the reformulation of the Bellman equation around the post-decision state in order to circumvent the calculation of the expectation. The proposed algorithm is applied to a retailer-inventory management problem.

Full Text