Combinatorial Multi-armed Bandit Research Articles

The Artificial Intelligence of Things (AIoT) is an emerging technology that enables numerous AIoT devices to participate in big data analytics and machine learning (ML) model training, providing various customized intelligent services for industry manufacturing. Federated Learning (FL) empowers AIoT applications with privacy-preserving distributed model training without sharing raw data. However, due to IoT devices’ limited computing and memory resources, existing FL approaches for AIoT applications cannot support efficient large-scale model training. Federated synergy learning (FSyL) is a promising collaborative paradigm that alleviates the computation and communication overhead on resource-constrained AIoT devices via offloading part of the ML model to the edge server for end-to-edge collaborative training. Existing FSyL works neither efficiently address the inter-round device selection to improve model diversity nor determine the intra-round edge association to reduce the training cost, which hinders the applications of FSyL-enable AIoT. Motivated by this issue, this paper first investigates the bottlenecks of executing FSyL in AIoT. It builds an optimization model of joint inter-round device selection and intra-round edge association for balancing model diversity and training cost. To tackle the intractable coupling problem, we present a framework named Online DE vice S elect I on and Ed G e Associatio N for Cost-Diversity Trade-offs FSyL (DESIGN). First, the edge association subproblem is extracted from the original problem, and game theory determines the optimal association decision for an arbitrary device selection. Then, based on the optimal association decision, device selection is modeled as a combinatorial multi-armed bandit (CMAB) problem. Finally, we propose an online mechanism to obtain joint device selection and edge association decisions. The performance of DESIGN is theoretically analyzed and experimentally evaluated on real-world datasets. The results show that DESIGN can achieve up to \(84.3\%\) in cost-saving with an accuracy improvement of \(23.6\%\) compared with the state-of-the-art.

Many Multi-Armed Bandit (MAB) based workers selection schemes have been proposed to select high-quality workers to enhance the quality of tasks. However, in Mobile Crowd Sensing (MCS), a complex mutual effect exists among task requestors, the MCS platforms, and workers. Only considering the interaction of two sides doesn’t make MCS a balanced ecosystem. Therefore, it is urgent to establish a tripartite mutual incentive mechanism to make the MCS system a balanced ecosystem. In this paper, a truth based Three-tier Combinatorial Multi-Armed Bandit (TCMAB) incentive mechanism is proposed for selecting each other to maximize their revenues in MCS. In TCMAB, there exists a three-tier and two-way MAB-based incentive scheme. For the mutual interaction between the platform and the worker, a truth-based CMAB scheme is established for the platform to select high-quality workers, and also a CMAB scheme is proposed for workers to select a “good” platform to report data in order to maximize their revenue. Besides, for the mutual interaction between the task requestor and the platform, the platforms adopt the CMAB-based scheme to select a high-payment task requestor that gives high payment. And a platform selection scheme base on CMAB is also established for task requestors to select the platforms which have lower fees and higher quality. What’s more, we don’t adopt the assumption that platforms get the data quality as soon as they get data, but propose a data quality acquisition scheme based on the truth data discovery and cooperation frequency, which is the base to instruct the three-tier interaction, thus establish a kind of truth-based MCS interaction ecosystems. Simulation results show that the proposed TCMAB provides an effective solution for the problem of information elicitation without verification (IEWV) in MCS, and can improve the utilities, data quality, and applications quality for MCS, which is not achieved in the previous studies significantly.

Combinatorial Multi-armed Bandit Research Articles

Articles published on Combinatorial Multi-armed Bandit

A survey of the application and technical improvement of the multi-armed bandit

Auction-based client selection for online Federated Learning

DESIGN: Online Device Selection and Edge Association for Federated Synergy Learning-enabled AIoT

CA-Live360: Crowd-assisted transcoding and delivery for live 360-degree video streaming

Cache content placement in the presence of fictitious requests in mmWave 5G IAB networks

Client selection for federated learning using combinatorial multi-armed bandit under long-term energy constraint

A Quality-Aware and Obfuscation-Based Data Collection Scheme for Cyber-Physical Metaverse Systems

Combinatorial Stochastic-Greedy Bandit

Immersive Multimedia Service Caching in Edge Cloud with Renewable Energy

Multi-hop relay selection for underwater acoustic sensor networks: A dynamic combinatorial multi-armed bandit learning approach

Investigation of selection and application of Multi-Armed Bandit algorithms in recommendation system

BTV-CMAB: A Bi-Directional Trust Verification-Based Combinatorial Multiarmed Bandit Scheme for Mobile Crowdsourcing

Thompson Sampling-Based Partially Observable Online Change Detection for Exponential Families

Truthful and Dual-direction Combinatorial Multi-Armed Bandit Scheme to Maximize Profit for Mobile Crowd Sensing

A Differentially Private Approach for Budgeted Combinatorial Multi-armed Bandits

Adaptive Label Propagation for Group Anomaly Detection in Large-Scale Networks

FedAB: Truthful Federated Learning With Auction-Based Combinatorial Multi-Armed Bandit

Spectrum Allocation and User Scheduling Based on Combinatorial Multi-Armed Bandit for 5G Massive MIMO.

Truth based three-tier Combinatorial Multi-Armed Bandit ecosystems for mobile crowdsensing

LEO satellite assisted UAV distribution using combinatorial bandit with fairness and budget constraints.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Combinatorial Multi-armed Bandit Research Articles

Articles published on Combinatorial Multi-armed Bandit

A survey of the application and technical improvement of the multi-armed bandit

Auction-based client selection for online Federated Learning

DESIGN: Online Device Selection and Edge Association for Federated Synergy Learning-enabled AIoT

CA-Live360: Crowd-assisted transcoding and delivery for live 360-degree video streaming

Cache content placement in the presence of fictitious requests in mmWave 5G IAB networks

Client selection for federated learning using combinatorial multi-armed bandit under long-term energy constraint

A Quality-Aware and Obfuscation-Based Data Collection Scheme for Cyber-Physical Metaverse Systems

Combinatorial Stochastic-Greedy Bandit

Immersive Multimedia Service Caching in Edge Cloud with Renewable Energy

Multi-hop relay selection for underwater acoustic sensor networks: A dynamic combinatorial multi-armed bandit learning approach

Investigation of selection and application of Multi-Armed Bandit algorithms in recommendation system

BTV-CMAB: A Bi-Directional Trust Verification-Based Combinatorial Multiarmed Bandit Scheme for Mobile Crowdsourcing

Thompson Sampling-Based Partially Observable Online Change Detection for Exponential Families

Truthful and Dual-direction Combinatorial Multi-Armed Bandit Scheme to Maximize Profit for Mobile Crowd Sensing

A Differentially Private Approach for Budgeted Combinatorial Multi-armed Bandits

Adaptive Label Propagation for Group Anomaly Detection in Large-Scale Networks

FedAB: Truthful Federated Learning With Auction-Based Combinatorial Multi-Armed Bandit

Spectrum Allocation and User Scheduling Based on Combinatorial Multi-Armed Bandit for 5G Massive MIMO.

Truth based three-tier Combinatorial Multi-Armed Bandit ecosystems for mobile crowdsensing

LEO satellite assisted UAV distribution using combinatorial bandit with fairness and budget constraints.