Piecewise linear value function approximation for factored MDPs

Pascal Poupart ,Relu Patrascu ,Craig Boutilier ,Dale Schuurmans

doi:10.5555/777092.777140

Abstract

A number of proposals have been put forth in recent years for the solution of Markov decision processes (MDPs) whose state (and sometimes action) spaces are factored. One recent class of methods involves linear value function approximation, where the optimal value function is assumed to be a linear combination of some set of basis functions, with the aim of finding suitable weights. While sophisticated techniques have been developed for finding the best approximation within this constrained space, few methods have been proposed for choosing a suitable basis set, or modifying it if solution quality is found wanting. We propose a general framework, and specific proposals, that address both of these questions. In particular, we examine weakly coupled MDPs where a number of subtasks can be viewed independently modulo resource constraints. We then describe methods for constructing a piecewise linear combination of the subtask value functions, using greedy decision tree techniques. We argue that this architecture is suitable for many types of MDPs whose combinatorics are determined largely by the existence multiple conflicting objectives.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Piecewise linear value function approximation for factored MDPs

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Sigma point policy iteration
...
-
, et. al. ...
12 May 2008
12 May 2008

A Survey of Linear Value Function Approximation in Reinforcement Learning
Shicheng Guo ... Xueyu Wei
-
Shicheng Guo, et. al.Shicheng Guo ... Xueyu Wei
01 Jan 2021
01 Jan 2021

Multi-agent temporal-difference learning with linear function approximation: Weak convergence under time-varying network topologies
Milos S Stankovic ... Srdjan S Stankovic
-
Milos S Stankovic, et. al.Milos S Stankovic ... Srdjan S Stankovic
01 Jul 2016
01 Jul 2016

A unified framework for linear function approximation of value functions in stochastic control
...
-
, et. al. ...
09 Sep 2013
09 Sep 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Piecewise linear value function approximation for factored MDPs

Abstract

Talk to us

Similar Papers