Survey of dynamic pricing based on Multi-Armed Bandit algorithms

Jiaming Qu

doi:10.54254/2755-2721/37/20230497

Abstract

Dynamic pricing seeks to determine the most optimal selling price for a product or service, taking into account factors like limited supply and uncertain demand. This study aims to provide a comprehensive exploration of dynamic pricing using the multi-armed bandit problem framework in various contexts. The investigation highlights the prevalence of Thompson sampling in dynamic pricing scenarios with a Bayesian backdrop, where the seller possesses prior knowledge of demand functions. On the other hand, in non-Bayesian situations, the Upper Confidence Bound (UCB) algorithm family gains traction due to their favorable regret bounds. As markets often exhibit temporal fluctuations, the domain of non-stationary multi-armed bandits within dynamic pricing emerges as crucial. Future research directions include enhancing traditional multi-armed bandit algorithms to suit online learning settings, especially those involving dynamic reward distributions. Additionally, merging prior insights into demand functions with contextual multi-armed bandit approaches holds promise for advancing dynamic pricing strategies. In conclusion, this study sheds light on dynamic pricing through the lens of multi-armed bandit problems, offering insights and pathways for further exploration.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Survey of dynamic pricing based on Multi-Armed Bandit algorithms

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering

Lead the way for us

Journal: Applied and Computational Engineering	Publication Date: Feb 7, 2024
License type: cc-by

Similar Papers

Performance variance in Multi-Armed Bandits: In-depth analysis of three core algorithms
Bowen Chen
Applied and Computational Engineering | VOL. 68
Bowen ChenBowen Chen
06 Jun 2024
Applied and Computational Engineering | VOL. 68

Enhancing UCB-tuned and Asymptotically Optimal UCB Algorithms through Weighted Average Techniques in Multi-Armed Bandit Scenarios
Chang Qu
Highlights in Science, Engineering and Technology | VOL. 94
Chang QuChang Qu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

A novel two-stage dynamic pricing model for logistics planning using an exploration–exploitation framework: A multi-armed bandit problem
Mahmoud Tajik ... Rouzbeh Ghousi
Expert Systems with Applications | VOL. 246
Mahmoud Tajik, et. al.Mahmoud Tajik ... Rouzbeh Ghousi
30 Dec 2023
Expert Systems with Applications | VOL. 246

Strategic insights from multi-armed bandits: Applications in real-time strategy games
Yuchen Sun
Applied and Computational Engineering | VOL. 68
Yuchen SunYuchen Sun
06 Jun 2024
Applied and Computational Engineering | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Survey of dynamic pricing based on Multi-Armed Bandit algorithms

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering