Multimodal Dynamic Pricing

Yining Wang,David Simchi-Levi,Boxiao Chen

doi:10.1287/mnsc.2020.3819

Abstract

We consider a single product dynamic pricing with demand learning. The candidate prices belong to a wide range of a price interval; the modeling of the demand functions is nonparametric in nature, imposing only smoothness regularity conditions. One important aspect of our model is the possibility of the expected reward function to be nonconcave and indeed multimodal, which leads to many conceptual and technical challenges. Our proposed algorithm is inspired by both the Upper-Confidence-Bound algorithm for multiarmed bandit and the Optimism-in-the-Face-of-Uncertainty principle arising from linear contextual bandits. The multiarmed bandit formulation arises from local-bin approximation of an unknown continuous demand function, and the linear contextual bandit formulation is then applied to obtain more accurate local polynomial approximators within each bin. Through rigorous regret analysis, we demonstrate that our proposed algorithm achieves optimal worst-case regret over a wide range of smooth function classes. More specifically, for k-times smooth functions and T selling periods, the regret of our proposed algorithm is [Formula: see text], which is shown to be optimal via the development of information theoretical lower bounds. We also show that in special cases, such as strongly concave or infinitely smooth reward functions, our algorithm achieves an [Formula: see text] regret, matching optimal regret established in previous works. Finally, we present computational results that verify the effectiveness of our method in numerical simulations.This paper was accepted by J. George Shanthikumar, big data analytics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal Dynamic Pricing

Abstract

Talk to us

Similar Papers

More From: Management Science

Lead the way for us

Journal: Management Science	Publication Date: Jan 27, 2021
Citations: 31

Similar Papers

Multi-Modal Dynamic Pricing
Yining Wang ... Boxiao Chen
SSRN Electronic Journal | VOL. -
Yining Wang, et. al.Yining Wang ... Boxiao Chen
26 Nov 2019
SSRN Electronic Journal | VOL. -

A novel two-stage dynamic pricing model for logistics planning using an exploration–exploitation framework: A multi-armed bandit problem
Mahmoud Tajik ... Rouzbeh Ghousi
Expert Systems with Applications | VOL. 246
Mahmoud Tajik, et. al.Mahmoud Tajik ... Rouzbeh Ghousi
30 Dec 2023
Expert Systems with Applications | VOL. 246

Survey of dynamic pricing based on Multi-Armed Bandit algorithms
Jiaming Qu
Applied and Computational Engineering | VOL. 37
Jiaming QuJiaming Qu
07 Feb 2024
Applied and Computational Engineering | VOL. 37

Enhancing UCB-tuned and Asymptotically Optimal UCB Algorithms through Weighted Average Techniques in Multi-Armed Bandit Scenarios
Chang Qu
Highlights in Science, Engineering and Technology | VOL. 94
Chang QuChang Qu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal Dynamic Pricing

Abstract

Talk to us

Similar Papers

More From: Management Science