Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control

Matthew Robards,Peter Sunehag

doi:10.1007/978-3-642-29946-9_7

Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control

Matthew Robards, Peter Sunehag

https://doi.org/10.1007/978-3-642-29946-9_7

Copy DOI

Publication Date: Jan 1, 2012

Citations: 9

Affiliation: Data61, Australian National University

#Gradient Based Algorithm #Generalized Policy Iteration + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We introduce and empirically evaluate two novel online gradient-based reinforcement learning algorithms with function approximation --- one model based, and the other model free. These algorithms come with the possibility of having non-squared loss functions which is novel in reinforcement learning, and seems to come with empirical advantages. We further extend a previous gradient based algorithm to the case of full control, by using generalized policy iteration. Theoretical properties of these algorithms are studied in a companion paper.

Full Text