Semi-parametric contextual bandits with graph-Laplacian regularization

Young-Geun Choi,Gi-Soo Kim,Seunghoon Paik,Myunghee Cho Paik

doi:10.1016/j.ins.2023.119367

Abstract

Non-stationarity is ubiquitous in human behavior and addressing it in the contextual bandits is challenging. Several works have addressed the problem by investigating semi-parametric contextual bandits and warned that ignoring non-stationarity could harm performances. Another prevalent human behavior is social interaction which has become available in a form of a social network or graph structure. As a result, graph-based contextual bandits have received much attention. In this paper, we propose SemiGraphTS, a novel contextual Thompson-sampling algorithm for a graph-based semi-parametric reward model. Our algorithm is the first to be proposed in this setting. We derive an upper bound of the cumulative regret that can be expressed as a multiple of a factor depending on the graph structure and the order for the semi-parametric model without a graph. We evaluate the proposed and existing algorithms via simulation and real data example.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-parametric contextual bandits with graph-Laplacian regularization

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Jun 29, 2023
Citations: 2

Similar Papers

Learning Sparse Causal Gaussian Networks With Experimental Intervention: Regularization and Coordinate Descent
Fei Fu ... Qing Zhou
Journal of the American Statistical Association | VOL. 108
Fei Fu, et. al.Fei Fu ... Qing Zhou
01 Mar 2013
Journal of the American Statistical Association | VOL. 108

A survey of smoothing techniques based on a backfitting algorithm in estimation of semiparametric additive models
Syed Ejaz Ahmed ... Dursun Aydın
WIREs Computational Statistics | VOL. 15
Syed Ejaz Ahmed, et. al.Syed Ejaz Ahmed ... Dursun Aydın
25 Dec 2022
WIREs Computational Statistics | VOL. 15

Bayesian hierarchical graph-structured model for pathway analysis using gene expression data
Hui Zhou ... Tian Zheng
Statistical Applications in Genetics and Molecular Biology | VOL. 12
Hui Zhou, et. al.Hui Zhou ... Tian Zheng
01 Jan 2013
Statistical Applications in Genetics and Molecular Biology | VOL. 12

DOFV distributions: a new diagnostic for the adequacy of parameter uncertainty in nonlinear mixed-effects models applied to the bootstrap.
Anne-Gaëlle Dosne ... Mats O Karlsson
Journal of pharmacokinetics and pharmacodynamics | VOL. 43
Anne-Gaëlle Dosne, et. al.Anne-Gaëlle Dosne ... Mats O Karlsson
11 Oct 2016
Journal of pharmacokinetics and pharmacodynamics | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-parametric contextual bandits with graph-Laplacian regularization

Abstract

Talk to us

Similar Papers

More From: Information Sciences