TARGETED SEQUENTIAL DESIGN FOR TARGETED LEARNING INFERENCE OF THE OPTIMAL TREATMENT RULE AND ITS MEAN REWARD.

Antoine Chambaz,Antoine Chambaz,Wenjing Zheng,Mark J Van Der Laan

doi:10.1214/16-aos1534

Antoine Chambaz, Antoine Chambaz + Show 2 more

Open Access

https://doi.org/10.1214/16-aos1534

Copy DOI

Journal: Annals of statistics	Publication Date: Dec 1, 2017
Citations: 17	License type: unspecified-oa

Affiliation: University of California, Berkeley

Abstract

This article studies the targeted sequential inference of an optimal treatment rule (TR) and its mean reward in the non-exceptional case, i.e., assuming that there is no stratum of the baseline covariates where treatment is neither beneficial nor harmful, and under a companion margin assumption. Our pivotal estimator, whose definition hinges on the targeted minimum loss estimation (TMLE) principle, actually infers the mean reward under the current estimate of the optimal TR. This data-adaptive statistical parameter is worthy of interest on its own. Our main result is a central limit theorem which enables the construction of confidence intervals on both mean rewards under the current estimate of the optimal TR and under the optimal TR itself. The asymptotic variance of the estimator takes the form of the variance of an efficient influence curve at a limiting distribution, allowing to discuss the efficiency of inference. As a by product, we also derive confidence intervals on two cumulated pseudo-regrets, a key notion in the study of bandits problems. A simulation study illustrates the procedure. One of the corner-stones of the theoretical study is a new maximal inequality for martingales with respect to the uniform entropy integral.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TARGETED SEQUENTIAL DESIGN FOR TARGETED LEARNING INFERENCE OF THE OPTIMAL TREATMENT RULE AND ITS MEAN REWARD.

Abstract

Talk to us

Similar Papers

More From: Annals of statistics

Lead the way for us

Similar Papers

Greedy outcome weighted tree learning of optimal personalized treatment rules.
Ruoqing Zhu ... Shuangge Ma
Biometrics | VOL. 73
Ruoqing Zhu, et. al.Ruoqing Zhu ... Shuangge Ma
04 Oct 2016
Biometrics | VOL. 73

Learning Individualized Treatment Rules for Multiple-Domain Latent Outcomes
Yuan Chen ... Yuanjia Wang
Journal of the American Statistical Association | VOL. 116
Yuan Chen, et. al.Yuan Chen ... Yuanjia Wang
19 Oct 2020
Journal of the American Statistical Association | VOL. 116

Risk calculators are useful but…
Xiaofei Wang ... Mark F Berry
The Journal of Thoracic and Cardiovascular Surgery | VOL. 151
Xiaofei Wang, et. al.Xiaofei Wang ... Mark F Berry
24 Sep 2015
The Journal of Thoracic and Cardiovascular Surgery | VOL. 151

Contrast weighted learning for robust optimal treatment rule estimation.
Xiaohan Guo ... Ai Ni
Statistics in medicine | VOL. 41
Xiaohan Guo, et. al.Xiaohan Guo ... Ai Ni
14 Sep 2022
Statistics in medicine | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TARGETED SEQUENTIAL DESIGN FOR TARGETED LEARNING INFERENCE OF THE OPTIMAL TREATMENT RULE AND ITS MEAN REWARD.

Abstract

Talk to us

Similar Papers

More From: Annals of statistics