Abstract

Suppose that one of two prices for the same product must be posted every day. Under each price, the demand function is described by a compound Poisson process with possibly unknown parameters. The objective is to sequentially post daily prices so as to maximize the total expected, possibly discounted gross revenue over a finite pricing horizon. To effectively balance between understanding the demand function and achieving economic revenues, we formulate the optimal pricing problem with a bandit model and characterize the solution by means of stochastic dynamic programming. When there is only one unknown demand function in the model, the optimal pricing decision is determined by a pricing index, whose limit is the Gittins index. These index values also demonstrate that it may be worth sacrificing some immediate payoff for the benefit of information gathering and better-informed decisions in the future. Moreover, the optimal stopping solution is derived and the myopic strategy is shown not to be optimal in general. When both demand functions are unknown, a version of the play-the-winner pricing rule is derived.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.