An investigation of progress related to stochastic stationary bandit algorithms

Yizhi Liu

doi:10.54254/2755-2721/34/20230326

Abstract

The Multi-armed Bandit algorithm stands as a consequential tool for informed decision-making, distinct from reliance on intuitive selections, given its systematic proclivity to meticulously assess accessible alternatives with the intent of discerning the most auspicious outcome. Amid the repertoire of algorithmic variations, the Stochastic Stationary Bandit algorithm assumes a foundational and enduring role, finding versatile application across diverse domains, including but not limited to digital advertising, price optimization, and recommendation systems. With these considerations in view, the present study embarks upon a comprehensive scrutiny of this subject matter. This paper reviews on the Explore-Then-Commit algorithm, Upper Confidence Bound algorithm, and Thompson Sampling algorithm by explaining, comparing their formulation, features, and expected results. Explore-Then-Commit algorithm has distinct phase to explore all the choices uniformly. Upper Confidence Bound algorithm make decisions by calculate an upper confidence index which is an overestimate for each choice. Thompson Sampling algorithm depends on randomness to make choices. Explore-Then-Commit algorithm faces the problem of when to explore and when to stop. Upper Confidence Bound algorithm and Thompson Sampling algorithm solve this problem by avoid certain phases. Multi-armed Bandit algorithm could deal with the process of displaying items of potential interest to users in a recommendation system, the delivery of resources in resource allocation, or the way to maximize revenue in a business.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An investigation of progress related to stochastic stationary bandit algorithms

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering

Lead the way for us

Journal: Applied and Computational Engineering	Publication Date: Feb 4, 2024
License type: cc-by

Similar Papers

Investigation of progress and application related to Multi-Armed Bandit algorithms
Zizhuo Liu
Applied and Computational Engineering | VOL. 37
Zizhuo LiuZizhuo Liu
07 Feb 2024
Applied and Computational Engineering | VOL. 37

Investigation of selection and application of Multi-Armed Bandit algorithms in recommendation system
Panyangjie Chen
Applied and Computational Engineering | VOL. 34
Panyangjie ChenPanyangjie Chen
04 Feb 2024
Applied and Computational Engineering | VOL. 34

Multi-Armed Bandit Algorithms: Analysis and Applications Across Domains
Qinchuan Zhang
Highlights in Science, Engineering and Technology | VOL. 94
Qinchuan ZhangQinchuan Zhang
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

A Analytical and Practical Insights into Multi-Armed Bandit Problems in Recommendation Systems
Maike Feng
Highlights in Science, Engineering and Technology | VOL. 94
Maike FengMaike Feng
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An investigation of progress related to stochastic stationary bandit algorithms

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering