Abstract
Dynamic pricing with demand learning is a very common problem that retailers face. In this problem, the retailer aims to maximize cumulative revenue collected over a finite time horizon by balancing two objectives: learning demand and maximizing revenue. In their paper “Dynamic Pricing with Unknown Nonparametric Demand and Limited Price Changes,” Perakis and Singhvi study this problem when the retailer makes no parametric assumption on the demand and seeks to reduce the amount of price experimentation because of the potential costs associated with price changes. They construct a pricing policy that uses second order approximations of the unknown demand function and establish when the proposed policy achieves near-optimal rate of regret while making very limited price changes. They also perform extensive numerical experiments to show that their proposed policy substantially improves over existing methods in terms of the total price changes, with comparable performance on the cumulative regret metric.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.