Traditionally inventory management models have focused on risk-neutral decision making with the objective of maximizing the expected rewards or minimizing costs over a specified time horizon. However, for items marked by high demand volatility such as fashion goods and technology products, this objective needs to be balanced against the risk associated with the decision. Depending on how the product performs vis-à-vis the seller's original forecast, the seller could end up with losses due to either short or surplus supply. Unfortunately, traditional models do not address this issue. Stochastic dynamic programming models have been extensively used for sequential decision making in the context of multi-period inventory management, but in the traditional way where one either minimizes costs or maximizes profits. Risk is implicitly considered by accounting for stock-out costs. Considering risk and reward simultaneously and explicitly in a stochastic dynamic setting is a cumbersome task and often difficult to implement for practical purposes, since dynamic programming is designed to optimize on one variable, not two. In this paper we develop an algorithm, Variance-Retentive Stochastic Dynamic Programming that tracks variance as well as expected reward in a stochastic dynamic programming model for inventory control. We use the mean–variance solutions in a heuristic, RiskTrackr, to construct efficient frontiers which could be an ideal decision support tool for risk-reward analysis.
Read full abstract