The Investigation of Progress and Application in the Multi-Armed Bandit Algorithm

Yixuan Feng,Shuaiyan Liu,Jiashuo Wang

doi:10.54254/2755-2721/97/2024melb0073

Abstract

This article delves deeply into the development process and practical applications of the multi-armed bandit algorithm in the current digital era. With the continuous popularity of online advertising and online learning, information has grown explosively, making decision optimization crucial. The multi-armed bandit algorithm, as a sequential decision-making model, encompasses common algorithms such as the greedy algorithm, -greedy algorithm, UCB algorithm, and Thompson sampling. Its main role is to seek the best balance between exploration and exploitation to solve the fundamental problems in reinforcement learning. The article introduces an internationally released datasets, namely MovieLens, and elaborates in detail a series of indicators for evaluation, including the average number of friends per user, the average number of listened-to artists per user, the average number of movie rating times, the average number of tags added by users, content diversity indicators, and statistics on the differences in click-through rates of recommendations for different types of movies. In addition, the article also presents the specific methods of literature collection, screening, analysis, and review. Its purpose is to understand the multi-armed bandit algorithm more deeply and provide strong guidance for the future development and wide application of this algorithm in various fields.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Investigation of Progress and Application in the Multi-Armed Bandit Algorithm

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering

Lead the way for us

Similar Papers

Multi-Armed Bandit Algorithms: Analysis and Applications Across Domains
Qinchuan Zhang
Highlights in Science, Engineering and Technology | VOL. 94
Qinchuan ZhangQinchuan Zhang
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

Strategy Selection Using Multi-Armed Bandit Algorithms in Financial Markets
Yushi Guo
Applied and Computational Engineering | VOL. 83
Yushi GuoYushi Guo
18 Oct 2024
Applied and Computational Engineering | VOL. 83

Investigation of frontier Multi-Armed Bandit algorithms and applications
Liangxu Wang
Applied and Computational Engineering | VOL. 34
Liangxu WangLiangxu Wang
04 Feb 2024
Applied and Computational Engineering | VOL. 34

Performance Comparison of UCB, TS, and -Greedy TS Algorithms through Simulation of Multi-Armed Bandit Machine
Zhuoran Liu
Applied and Computational Engineering | VOL. 83
Zhuoran LiuZhuoran Liu
31 Oct 2024
Applied and Computational Engineering | VOL. 83

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Investigation of Progress and Application in the Multi-Armed Bandit Algorithm

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering