Kernel-Based Reinforcement Learning on Representative States

Branislav Kveton,Georgios Theocharous

doi:10.1609/aaai.v26i1.8294

Abstract

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batch-mode reinforcement learning (RL) with continuous state variables. The method is an approximation to kernel-based RL on a set of k representative states. Similarly to kernel-based RL, our solution is a fixed point of a kernelized Bellman operator and can approximate the optimal solution to an arbitrary level of granularity. Unlike kernel-based RL, our method is fast. In particular, our policies can be computed in O(n) time, where n is the number of training examples. The time complexity of kernel-based RL is Ω(n2). We introduce our method, analyze its convergence, and compare it to existing work. The method is evaluated on two existing control problems with 2 to 4 continuous variables and a new problem with 64 variables. In all cases, we outperform state-of-the-art results and offer simpler solutions.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Kernel-Based Reinforcement Learning on Representative States

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Sep 20, 2021
Citations: 15

Similar Papers

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Practical kernel-based reinforcement learning
...
Journal of machine learning research : JMLR | VOL. 17
, et. al. ...
01 Jan 2015
Journal of machine learning research : JMLR | VOL. 17

QMDP: DASH Adaptation using Queueing Theory within a Markov Decision Process
Kevin Gatimu ... Ben Lee
-
Kevin Gatimu, et. al.Kevin Gatimu ... Ben Lee
09 Jan 2021
09 Jan 2021

A Reinforcement Learning Method for a Hybrid Flow-Shop Scheduling Problem
Han ... Guo
Algorithms | VOL. 12
Han, et. al. Han ... Guo
23 Oct 2019
Algorithms | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel-Based Reinforcement Learning on Representative States

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence