Arm order recognition in multi-armed bandit problem with laser chaos time series

Naoki Narisawa,Makoto Naruse,Nicolas Chauvet,Mikio Hasegawa

doi:10.1038/s41598-021-83726-8

Abstract

By exploiting ultrafast and irregular time series generated by lasers with delayed feedback, we have previously demonstrated a scalable algorithm to solve multi-armed bandit (MAB) problems utilizing the time-division multiplexing of laser chaos time series. Although the algorithm detects the arm with the highest reward expectation, the correct recognition of the order of arms in terms of reward expectations is not achievable. Here, we present an algorithm where the degree of exploration is adaptively controlled based on confidence intervals that represent the estimation accuracy of reward expectations. We have demonstrated numerically that our approach did improve arm order recognition accuracy significantly, along with reduced dependence on reward environments, and the total reward is almost maintained compared with conventional MAB methods. This study applies to sectors where the order information is critical, such as efficient allocation of resources in information and communications technology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Feb 24, 2021
Citations: 11	License type: open-access

R Discovery Prime

R Discovery Prime

Arm order recognition in multi-armed bandit problem with laser chaos time series

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

An Optimal Algorithm for the Stochastic Bandits While Knowing the Near-Optimal Mean Reward.
Shangdong Yang ... Yang Gao
IEEE transactions on neural networks and learning systems | VOL. 32
Shangdong Yang, et. al.Shangdong Yang ... Yang Gao
01 May 2021
IEEE transactions on neural networks and learning systems | VOL. 32

An instrument validation of TQM enablers and IT resources in Indian ICT organizations
Suby Khanam ... Jamshed Siddiqui
Journal of Systems and Information Technology | VOL. 22
Suby Khanam, et. al.Suby Khanam ... Jamshed Siddiqui
13 Jul 2020
Journal of Systems and Information Technology | VOL. 22

In-depth Exploration and Implementation of Multi-Armed Bandit Models Across Diverse Fields
Jiazhen Wu
Highlights in Science, Engineering and Technology | VOL. 94
Jiazhen WuJiazhen Wu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

DCOPs and bandits: exploration and exploitation in decentralised coordination
...
-
, et. al. ...
04 Jun 2012
04 Jun 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Arm order recognition in multi-armed bandit problem with laser chaos time series

Abstract

Talk to us

Similar Papers

More From: Scientific Reports