Security Assessment of the Contextual Multi-Armed Bandit - RL Algorithm for Link Adaptation

Mariam El-Sobky,Mervat Abu-Elkheir,Hisham Sarhan

doi:10.1109/niles50944.2020.9257955

Abstract

Industry is increasingly adopting Reinforcement Learning algorithms (RL) in production without thoroughly analyzing their security features. In addition to the potential threats that may arise if the functionality of these algorithms is compromised while in operation. One of the well-known RL algorithms is the Contextual Multi-Armed Bandit (CMAB) algorithm. In this paper, we explore how the CMAB can be used to solve the Link Adaptation problem – a well-known problem in the telecommunication industry by learning the optimal transmission parameters that will maximize a communication link’s throughput. We analyze the potential vulnerabilities of the algorithm and how they may adversely affect link parameters computation. Additionally, we present a provable security assessment for the Contextual Multi-Armed Bandit Reinforcement Learning (CMAB-RL) algorithm in a network simulated environment using Ray. This is by demonstrating CMAB security vulnerabilities theoretically and practically. Some security controls are proposed for CMAB agent and the surrounding environment. In order to fix those vulnerabilities and mitigate the risk. These controls can be applied to other RL agents in order to design more robust and secure RL agents.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Security Assessment of the Contextual Multi-Armed Bandit - RL Algorithm for Link Adaptation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Reinforcement learning with algorithms from probabilistic structure estimation
Jonathan P Epperlein ... Robert Shorten
Automatica | VOL. 144
Jonathan P Epperlein, et. al.Jonathan P Epperlein ... Robert Shorten
06 Aug 2022
Automatica | VOL. 144

Context Enhancement for Linear Contextual Multi-Armed Bandits
Nicolas Gutowski ... Fabien Chhel
-
Nicolas Gutowski, et. al.Nicolas Gutowski ... Fabien Chhel
01 Nov 2018
01 Nov 2018

Using Individual Accuracy to Create Context for Non-Contextual Multi-Armed Bandit Problems
Nicolas Gutowski ... Fabien Chhel
-
Nicolas Gutowski, et. al.Nicolas Gutowski ... Fabien Chhel
01 Mar 2019
01 Mar 2019

Functional Contour-following via Haptic Perception and Reinforcement Learning.
Randall B Hellman ... Veronica J Santos
IEEE Transactions on Haptics | VOL. 11
Randall B Hellman, et. al.Randall B Hellman ... Veronica J Santos
18 Sep 2017
IEEE Transactions on Haptics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Security Assessment of the Contextual Multi-Armed Bandit - RL Algorithm for Link Adaptation

Abstract

Talk to us

Similar Papers