Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models

Aurélien Garivier,Emilie Kaufmann

doi:10.1080/07474946.2021.1847965

Abstract

In this article, we study sequential testing problems with overlapping hypotheses. We first focus on the simple problem of assessing if the mean μ of a Gaussian distribution is smaller or larger than a fixed if both answers are considered to be correct. Then, we consider probably approximately correct best arm identification in a bandit model: given K probability distributions on with means we derive the asymptotic complexity of identifying, with risk at most δ, an index such that We provide nonasymptotic bounds on the error of a parallel general likelihood ratio test, which can also be used for more general testing problems. We further propose a lower bound on the number of observations needed to identify a correct hypothesis. Those lower bounds rely on information-theoretic arguments, and specifically on two versions of a change of measure lemma (a high-level form and a low-level form) whose relative merits are discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models

Abstract

Talk to us

Similar Papers

More From: Sequential Analysis

Lead the way for us

Journal: Sequential Analysis	Publication Date: Jan 15, 2021
Citations: 7

Similar Papers

Antenatal screening for haemoglobinopathies in primary care: a cohort study and cluster randomised trial to inform a simulation model. The Screening for Haemoglobinopathies in First Trimester (SHIFT) trial
E Dormandy ... A Juarez-Garcia
Health Technology Assessment | VOL. 14
E Dormandy, et. al.E Dormandy ... A Juarez-Garcia
01 Apr 2010
Health Technology Assessment | VOL. 14

Optimal Sequential Tests for Two Simple Hypotheses
Andrey Novikov
Sequential Analysis | VOL. 28
Andrey NovikovAndrey Novikov
27 Apr 2009
Sequential Analysis | VOL. 28

Dynamic scheduling for production systems operating in a random environment

-

01 Jan 2003
01 Jan 2003

Performance of Decomposition-Based Many-Objective Algorithms Strongly Depends on Pareto Front Shapes
Hisao Ishibuchi ... Yusuke Nojima
IEEE Transactions on Evolutionary Computation | VOL. 21
Hisao Ishibuchi, et. al.Hisao Ishibuchi ... Yusuke Nojima
01 Apr 2017
IEEE Transactions on Evolutionary Computation | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models

Abstract

Talk to us

Similar Papers

More From: Sequential Analysis