SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits

Radu Ciucanu,Marta Soare,Pascal Lafourcade,Gael Marcadet

doi:10.1613/jair.1.13163

Abstract

The multi-armed bandit is a reinforcement learning model where a learning agent repeatedly chooses an action (pull a bandit arm) and the environment responds with a stochastic outcome (reward) coming from an unknown distribution associated with the chosen arm. Bandits have a wide-range of application such as Web recommendation systems. We address the cumulative reward maximization problem in a secure federated learning setting, where multiple data owners keep their data stored locally and collaborate under the coordination of a central orchestration server. We rely on cryptographic schemes and propose Samba, a generic framework for Secure federAted Multi-armed BAndits. Each data owner has data associated to a bandit arm and the bandit algorithm has to sequentially select which data owner is solicited at each time step. We instantiate Samba for five bandit algorithms. We show that Samba returns the same cumulative reward as the nonsecure versions of bandit algorithms, while satisfying formally proven security properties. We also show that the overhead due to cryptographic primitives is linear in the size of the input, which is confirmed by our proof-of-concept implementation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Artificial Intelligence Research	Publication Date: Feb 23, 2022
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Similar Papers

Secure protocols for cumulative reward maximization in stochastic multi-armed bandits
Radu Ciucanu ... Pascal Lafourcade
Journal of Computer Security | VOL. 31
Radu Ciucanu, et. al.Radu Ciucanu ... Pascal Lafourcade
26 Jan 2023
Journal of Computer Security | VOL. 31

Secure Outsourcing of Multi-armed Bandits
Radu Ciucanu ... Marius Lombard-Platet
-
Radu Ciucanu, et. al.Radu Ciucanu ... Marius Lombard-Platet
30 Sep 2020
30 Sep 2020

SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits (Extended Abstract)
Radu Ciucanu ... Marta Soare
-
Radu Ciucanu, et. al.Radu Ciucanu ... Marta Soare
01 Aug 2023
01 Aug 2023

Regulation of exploration for simple regret minimization in Monte-Carlo tree search
Yun-Ching Liu ... Yoshimasa Tsuruoka
-
Yun-Ching Liu, et. al.Yun-Ching Liu ... Yoshimasa Tsuruoka
01 Aug 2015
01 Aug 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research