A unified framework for linear function approximation of value functions in stochastic control

Matilde Sánchez-Fernández ,Sergio Valcarcel ,Santiago Zazo

doi:10.5281/zenodo.43656

A unified framework for linear function approximation of value functions in stochastic control

Matilde Sánchez-Fernández , Sergio Valcarcel + Show 1 more

https://doi.org/10.5281/zenodo.43656

Copy DOI

Publication Date: Sep 9, 2013

Citations: 10

Affiliation: Carlos III University of Madrid, Universidad Politécnica de Madrid

#Linear Approximation #Efficient Adaptive Algorithm + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper contributes with a unified formulation that merges previous analysis on the prediction of the performance (value function) of certain sequence of actions (policy) when an agent operates a Markov decision process with large state-space. When the states are represented by features and the value function is linearly approximated, our analysis reveals a new relationship between two common cost functions used to obtain the optimal approximation. In addition, this analysis allows us to propose an efficient adaptive algorithm that provides an unbiased linear estimate. The performance of the proposed algorithm is illustrated by simulation, showing competitive results when compared with the state-of-the-art solutions.

Full Text