Statistical Inference for Online Decision Making via Stochastic Gradient Descent

Haoyu Chen,Wenbin Lu,Rui Song

doi:10.1080/01621459.2020.1826325

Haoyu Chen, Wenbin Lu + Show 1 more

Open Access

https://doi.org/10.1080/01621459.2020.1826325

Copy DOI

Abstract

Online decision making aims to learn the optimal decision rule by making personalized decisions and updating the decision rule recursively. It has become easier than before with the help of big data, but new challenges also come along. Since the decision rule should be updated once per step, an offline update which uses all the historical data is inefficient in computation and storage. To this end, we propose a completely online algorithm that can make decisions and update the decision rule online via stochastic gradient descent. It is not only efficient but also supports all kinds of parametric reward models. Focusing on the statistical inference of online decision making, we establish the asymptotic normality of the parameter estimator produced by our algorithm and the online inverse probability weighted value estimator we used to estimate the optimal value. Online plugin estimators for the variance of the parameter and value estimators are also provided and shown to be consistent, so that interval estimation and hypothesis test are possible using our method. The proposed algorithm and theoretical results are tested by simulations and a real data application to news article recommendation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Statistical Inference for Online Decision Making via Stochastic Gradient Descent

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association

Lead the way for us

Journal: Journal of the American Statistical Association	Publication Date: Nov 19, 2020
Citations: 9

Similar Papers

GEAR: On optimal decision making with auxiliary data
Hengrui Cai ... Rui Song
Stat | VOL. 10
Hengrui Cai, et. al.Hengrui Cai ... Rui Song
21 Jul 2021
Stat | VOL. 10

The evidence interval and the Bayesian evidence value: On a unified theory for Bayesian hypothesis testing and interval estimation.
Riko Kelter
British Journal of Mathematical and Statistical Psychology | VOL. 75
Riko KelterRiko Kelter
01 Mar 2022
British Journal of Mathematical and Statistical Psychology | VOL. 75

Forest Sampling Desk Reference
Evert W Johnson
-
Evert W JohnsonEvert W Johnson
27 Jun 2000
27 Jun 2000

System dynamics perspectives and modeling opportunities for research in operations management
John Sterman ... Kevin Linderman
Journal of Operations Management | VOL. 39-40
John Sterman, et. al.John Sterman ... Kevin Linderman
29 Jul 2015
Journal of Operations Management | VOL. 39-40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical Inference for Online Decision Making via Stochastic Gradient Descent

Abstract

Talk to us

Similar Papers

More From: Journal of the American Statistical Association