Closing the Gap: A Learning Algorithm for Lost-Sales Inventory Systems with Lead Times

Huanan Zhang,Xiuli Chao,Cong Shi

doi:10.1287/mnsc.2019.3288

Abstract

We consider a periodic-review, single-product inventory system with lost sales and positive lead times under censored demand. In contrast to the classical inventory literature, we assume the firm does not know the demand distribution a priori and makes an adaptive inventory-ordering decision in each period based only on the past sales (censored demand) data. The standard performance measure is regret, which is the cost difference between a learning algorithm and the clairvoyant (full-information) benchmark. When the benchmark is chosen to be the (full-information) optimal base-stock policy, Huh et al. [Huh WT, Janakiraman G, Muckstadt JA, Rusmevichientong P (2009a) An adaptive algorithm for finding the optimal base-stock policy in lost sales inventory systems with censored demand. Math. Oper. Res. 34(2):397–416.] developed a nonparametric learning algorithm with a cubic-root convergence rate on regret. An important open question is whether there exists a nonparametric learning algorithm whose regret rate matches the theoretical lower bound of any learning algorithms. In this work, we provide an affirmative answer to this question. More precisely, we propose a new nonparametric algorithm termed the simulated cycle-update policy and establish a square-root convergence rate on regret, which is proven to be the lower bound of any learning algorithm. Our algorithm uses a random cycle-updating rule based on an auxiliary simulated system running in parallel and also involves two new concepts, namely the withheld on-hand inventory and the double-phase cycle gradient estimation. The techniques developed are effective for learning a stochastic system with complex system dynamics and lasting impact of decisions. This paper was accepted by Yinyu Ye, optimization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Closing the Gap: A Learning Algorithm for Lost-Sales Inventory Systems with Lead Times

Abstract

Talk to us

Similar Papers

More From: Management Science

Lead the way for us

Journal: Management Science	Publication Date: Feb 26, 2017
Citations: 83

Similar Papers

Closing the Gap: A Learning Algorithm for the Lost-Sales Inventory System with Lead Times
Huanan Zhang ... Cong Shi
SSRN Electronic Journal | VOL. -
Huanan Zhang, et. al.Huanan Zhang ... Cong Shi
01 Jan 2017
SSRN Electronic Journal | VOL. -

An Adaptive Algorithm for Finding the Optimal Base-Stock Policy in Lost Sales Inventory Systems with Censored Demand
Woonghee Tim Huh ... Ganesh Janakiraman
Mathematics of Operations Research | VOL. 34
Woonghee Tim Huh, et. al.Woonghee Tim Huh ... Ganesh Janakiraman
01 May 2009
Mathematics of Operations Research | VOL. 34

Bounds on the Solution of the Lagged Optimal Inventory Equation with No Demand Backlogging and Proportional Costs
Thomas E Morton
SIAM Review | VOL. 11
Thomas E MortonThomas E Morton
01 Oct 1969
SIAM Review | VOL. 11

Tailored Base-Surge Policies in Dual-Sourcing Inventory Systems with Demand Learning
Boxiao Chen ... Cong Shi
SSRN Electronic Journal | VOL. -
Boxiao Chen, et. al.Boxiao Chen ... Cong Shi
27 Sep 2019
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Closing the Gap: A Learning Algorithm for Lost-Sales Inventory Systems with Lead Times

Abstract

Talk to us

Similar Papers

More From: Management Science