Estimating the Maximum Expected Value in Continuous Reinforcement Learning Problems

Carlo D'Eramo,Matteo Pirotta,Alessandro Nuara,Marcello Restelli

doi:10.1609/aaai.v31i1.10887

Abstract

This paper is about the estimation of the maximum expected value of an infinite set of random variables.This estimation problem is relevant in many fields, like the Reinforcement Learning (RL) one.In RL it is well known that, in some stochastic environments, a bias in the estimation error can increase step-by-step the approximation error leading to large overestimates of the true action values. Recently, some approaches have been proposed to reduce such bias in order to get better action-value estimates, but are limited to finite problems.In this paper, we leverage on the recently proposed weighted estimator and on Gaussian process regression to derive a new method that is able to natively handle infinitely many random variables.We show how these techniques can be used to face both continuous state and continuous actions RL problems.To evaluate the effectiveness of the proposed approach we perform empirical comparisons with related approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimating the Maximum Expected Value in Continuous Reinforcement Learning Problems

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Feb 13, 2017
Citations: 12

Similar Papers

Continuous residual reinforcement learning for traffic signal control optimization
Mohammad Aslani ... Stefan Seipel
Canadian Journal of Civil Engineering | VOL. 45
Mohammad Aslani, et. al.Mohammad Aslani ... Stefan Seipel
01 Jan 2018
Canadian Journal of Civil Engineering | VOL. 45

Towards continual reinforcement learning through evolutionary meta-learning
Djordje Grbic ... Sebastian Risi
-
Djordje Grbic, et. al.Djordje Grbic ... Sebastian Risi
13 Jul 2019
13 Jul 2019

Continuous reinforcement learning to adapt multi-objective optimization online for robot motion
Kai Zhang ... Sterling McLeod
International Journal of Advanced Robotic Systems | VOL. 17
Kai Zhang, et. al.Kai Zhang ... Sterling McLeod
01 Mar 2020
International Journal of Advanced Robotic Systems | VOL. 17

Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning
Yanan Xiao ... Lu Jiang
-
Yanan Xiao, et. al.Yanan Xiao ... Lu Jiang
01 Nov 2022
01 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimating the Maximum Expected Value in Continuous Reinforcement Learning Problems

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence