Gaussian process bandits with adaptive discretization

Shubhanshu Shekhar,Tara Javidi

doi:10.1214/18-ejs1497

Abstract

In this paper, the problem of maximizing a black-box function $f:\mathcal{X}\to \mathbb{R}$ is studied in the Bayesian framework with a Gaussian Process prior. In particular, a new algorithm for this problem is proposed, and high probability bounds on its simple and cumulative regret are established. The query point selection rule in most existing methods involves an exhaustive search over an increasingly fine sequence of uniform discretizations of $\mathcal{X}$. The proposed algorithm, in contrast, adaptively refines $\mathcal{X}$ which leads to a lower computational complexity, particularly when $\mathcal{X}$ is a subset of a high dimensional Euclidean space. In addition to the computational gains, sufficient conditions are identified under which the regret bounds of the new algorithm improve upon the known results. Finally, an extension of the algorithm to the case of contextual bandits is proposed, and high probability bounds on the contextual regret are presented.

Highlights

We consider the problem of maximizing a function f : X → R from its noisy observations of the form yt = f + ηt, t = 1, 2, . . . , n, (1.1)where ηt is the observation noise at time t
We address two issues with existing approaches to the Gaussian Process (GP) bandits problem: 1. As discussed above, all the existing GP bandit algorithms which minimize the cumulative regret require solving an auxiliary optimization problem over the entire search space for selecting a query point which may be computationally infeasible, and practical implementations resort to various approximation techniques which do not come with theoretical guarantees
We extend our algorithm for GP bandits to the contextual GP bandits and obtain bounds on the contextual regret

Summary

Introduction

We further assume that the function f is expensive to evaluate, and we are allocated a budget of n function evaluations. This problem can be thought of as an extension of the multi-armed bandit (MAB) problem to the case of infinite (possibly uncountable) arms indexed by the set X. The goal is to design a strategy of sequentially selecting query points xt ∈ X based on the past observations {(xi, yi); 1 ≤ i ≤ t − 1} and the prior on f. As in the case of MAB with finitely many arms, the performance of any query point selection strategy is usually measured by the cumulative regret Rn: n

Objectives

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2018
Citations: 26	License type: cc-by

R Discovery Prime

R Discovery Prime

Gaussian process bandits with adaptive discretization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Bayesian function optimization with adaptive discretization
Shubhanshu Shekhar ... Tara Javidi
-
Shubhanshu Shekhar, et. al.Shubhanshu Shekhar ... Tara Javidi
01 Oct 2017
01 Oct 2017

On the Necessary and Sufficient Conditions of a Meaningful Distance Function for High Dimensional Data Space
Chih-Ming Hsu ... Ming-Syan Chen
-
Chih-Ming Hsu, et. al.Chih-Ming Hsu ... Ming-Syan Chen
20 Apr 2006
20 Apr 2006

Adaptive Discretization using Voronoi Trees for Continuous POMDPs
Marcus Hoerger ... Nan Ye
The International Journal of Robotics Research | VOL. 43
Marcus Hoerger, et. al.Marcus Hoerger ... Nan Ye
08 Aug 2023
The International Journal of Robotics Research | VOL. 43

Bayesian framework for least-squares support vector machine classifiers, gaussian processes, and kernel Fisher discriminant analysis.
T Van Gestel ... J A K Suykens
Neural Computation | VOL. 14
T Van Gestel, et. al.T Van Gestel ... J A K Suykens
01 May 2002
Neural Computation | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gaussian process bandits with adaptive discretization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics