Differentially Private Linear Bandits with Partial Distributed Feedback

Fengjiao Li,Bo Ji,Xingyu Zhou

doi:10.23919/wiopt56218.2022.9930524

Abstract

In this paper, we study the problem of global reward maximization with only partial distributed feedback. This problem is motivated by several real-world applications (e.g., cellular network configuration, dynamic pricing, and policy selection) where an action taken by a central entity influences a large population that contributes to the global reward. However, collecting such reward feedback from the entire population not only incurs a prohibitively high cost, but often leads to privacy concerns. To tackle this problem, we consider differentially private distributed linear bandits, where only a subset of users from the population are selected (called clients) to participate in the learning process and the central server learns the global model from such partial feedback by iteratively aggregating these clients’ local feedback in a differentially private fashion. We then propose a unified algorithmic learning framework, called differentially private distributed phased elimination (DP-DPE), which can be naturally integrated with popular differential privacy (DP) models (including central DP, local DP, and shuffle DP). Furthermore, we prove that DP-DPE achieves both sublinear regret and sublinear communication cost. Interestingly, DP-DPE also achieves privacy protection “for free” in the sense that the additional cost due to privacy guarantees is a lower-order additive term. Finally, we conduct simulations to corroborate our theoretical results and demonstrate the effectiveness of DP-DPE.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Differentially Private Linear Bandits with Partial Distributed Feedback

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

(Private) Kernelized Bandits with Distributed Biased Feedback
Fengjiao Li ... Bo Ji
Proceedings of the ACM on Measurement and Analysis of Computing Systems | VOL. 7
Fengjiao Li, et. al.Fengjiao Li ... Bo Ji
27 Feb 2023
Proceedings of the ACM on Measurement and Analysis of Computing Systems | VOL. 7

(Private) Kernelized Bandits with Distributed Biased Feedback
Fengjiao Li ... Bo Ji
-
Fengjiao Li, et. al.Fengjiao Li ... Bo Ji
19 Jun 2023
19 Jun 2023

How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy
Natalia Ponomareva ... Abhradeep Guha Thakurta
Journal of Artificial Intelligence Research | VOL. 77
Natalia Ponomareva, et. al.Natalia Ponomareva ... Abhradeep Guha Thakurta
23 Jul 2023
Journal of Artificial Intelligence Research | VOL. 77

Bidirectional LSTM-Based Privacy Preserving Method for Trajectory Generation
Xiangjie He ... Wei Jiang
Journal of Intelligence and Knowledge Engineering | VOL. 2
Xiangjie He, et. al.Xiangjie He ... Wei Jiang
01 Jun 2024
Journal of Intelligence and Knowledge Engineering | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Differentially Private Linear Bandits with Partial Distributed Feedback

Abstract

Talk to us

Similar Papers