Stochastic Control Research Articles

We consider a stochastic lost-sales inventory control system with lead time L over a planning horizon T. Supply is uncertain, and it is a function of the order quantity (because of random yield/capacity, etc.). We aim to minimize the T-period cost, a problem that is known to be computationally intractable even under known distributions of demand and supply. In this paper, we assume that both the demand and supply distributions are unknown and develop a computationally efficient online learning algorithm. We show that our algorithm achieves a regret (i.e., the performance gap between the cost of our algorithm and that of an optimal policy over T periods) of [Formula: see text] when [Formula: see text]. We do so by (1) showing that our algorithm’s cost is higher by at most [Formula: see text] for any [Formula: see text] compared with an optimal constant-order policy under complete information (a widely used algorithm) and (2) leveraging the latter’s known performance guarantee from the existing literature. To the best of our knowledge, a finite sample [Formula: see text] (and polynomial in L) regret bound when benchmarked against an optimal policy is not known before in the online inventory control literature. A key challenge in this learning problem is that both demand and supply data can be censored; hence, only truncated values are observable. We circumvent this challenge by showing that the data generated under an order quantity q2 allow us to simulate the performance of not only q2 but also, q1 for all [Formula: see text], a key observation to obtain sufficient information even under data censoring. By establishing a high-probability coupling argument, we are able to evaluate and compare the performance of different order policies at their steady state within a finite time horizon. Because the problem lacks convexity, commonly used learning algorithms, such as stochastic gradient decent and bisection, cannot be applied, and instead, we develop an active elimination method that adaptively rules out suboptimal solutions. This paper was accepted by Victor Martínez-de-Albéniz, operations management. Funding: This work is supported by the National Science Foundation [Grant CCF-2312205]. Z. Zhou also acknowledges the New York University’s 2024 Center for Global Economy and Business [Faculty Research Grant] and New York University [Research Catalyst Prize]. Supplemental Material: The online appendix is available at https://doi.org/10.1287/mnsc.2022.02476 .

Problem definition: Medical operations require a large volume and variety of consumable supplies that are kept in hospital inventory and replenished on a regular basis. Stringent requirements on the availability of these supplies, together with high variability in their daily usage, contribute to the high inventory costs of the surgical departments in hospitals. We investigate the value of utilizing Advance Booking Information (ABI) on elective surgeries—which are often booked up to months in advance—in reducing inventory costs. Methodology/results: We study a single-item, periodic-review, stochastic inventory control problem, where the item demand in each period is driven by the number and type of surgeries requiring the item, and with the available information on elective surgeries integrated into the ordering decisions. Given that item usage from each case is uncertain and only realized after the surgery, ABI provides imperfect information on future demand. Through exact analysis of a simplified version of the problem, as well as extensive numerical experiments using synthetic and real data, enabled using a state aggregation technique, we provide insights on and quantify the value of using ABI as a function of the number of periods of ABI integrated into the ordering decisions. We identify a relevant parameter regime—namely, high backlog (relative to holding) costs and when surgeries are booked sufficiently in advance—where the value of using ABI could be significant and the majority of the benefits can be gained through incorporating only one period of ABI beyond the order lead time. In a case study conducted using real data, we observe up to 26% reduction in average inventory levels, without violating the service levels. Managerial implications: By incorporating readily available elective surgery schedules into replenishment decisions of surgical supplies, hospitals could significantly reduce inventory costs without compromising the availability of the supplies. Funding: This work was partially funded by The Ontario Ministry of Government and Consumer Services (MGCS). The views expressed in the paper are the views of the authors and do not necessarily reflect those of the Province. Supplemental Material: The e-companion is available at https://doi.org/10.1287/msom.2021.0063 .

Stochastic Control Research Articles

Related Topics

Articles published on Stochastic Control

Existence of solutions to stochastic p(t,x)-Laplace equations and applications

Hybrid stochastic control strategy by two-layer networks for dissipating urban traffic congestion

Ergodic Control of Bipartite Matching Queues with Class Change and Matching Failure

Long Run Stochastic Control Problems with General Discounting

Farm debt and the over-exploitation of natural capital

Output Feedback Stochastic Model Predictive Control for Linear Systems with Convex Optimization Approach

On convergence of occupational measures sets of a discrete-time stochastic control system, with applications to averaging of hybrid systems

Uncertainty-resilient constrained rendezvous trajectory optimization via stochastic feedback control and unscented transformation

Derivation of Dirac equation from the stochastic optimal control principles of quantum mechanics.

Stochastic near-optimal controls for treatment and vaccination in a COVID-19 model with transmission incorporating Lévy jumps

Stochastic optimal control of a coupled tri-stable energy harvester under correlated colored noises

Caputo fractional backward stochastic differential equations driven by fractional Brownian motion with delayed generator

Stochastic LQ optimal control for Markov jumping systems with multiplicative noise using reinforcement learning

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

Multi-objective continuous review inventory policy using MOPSO and TOPSIS methods

Markov-Based Stochastic Stabilization Control for MMC-HVDC Systems With Inertia Supporting Under Random Disturbances

Event-Triggered Stochastic Model Predictive Control for Constrained Queueing Networks

Inventory Management with Advance Booking Information: The Case of Surgical Supplies and Elective Surgeries

Deep L1 Stochastic Optimal Control Policies for Planetary Soft Landing

Optimal dividend and proportional reinsurance strategy for the risk model with common shock dependence

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Stochastic Control Research Articles

Related Topics

Articles published on Stochastic Control

Existence of solutions to stochastic p(t,x)-Laplace equations and applications

Hybrid stochastic control strategy by two-layer networks for dissipating urban traffic congestion

Ergodic Control of Bipartite Matching Queues with Class Change and Matching Failure

Long Run Stochastic Control Problems with General Discounting

Farm debt and the over-exploitation of natural capital

Output Feedback Stochastic Model Predictive Control for Linear Systems with Convex Optimization Approach

On convergence of occupational measures sets of a discrete-time stochastic control system, with applications to averaging of hybrid systems

Uncertainty-resilient constrained rendezvous trajectory optimization via stochastic feedback control and unscented transformation

Derivation of Dirac equation from the stochastic optimal control principles of quantum mechanics.

Stochastic near-optimal controls for treatment and vaccination in a COVID-19 model with transmission incorporating Lévy jumps

Stochastic optimal control of a coupled tri-stable energy harvester under correlated colored noises

Caputo fractional backward stochastic differential equations driven by fractional Brownian motion with delayed generator

Stochastic LQ optimal control for Markov jumping systems with multiplicative noise using reinforcement learning

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

Multi-objective continuous review inventory policy using MOPSO and TOPSIS methods

Markov-Based Stochastic Stabilization Control for MMC-HVDC Systems With Inertia Supporting Under Random Disturbances

Event-Triggered Stochastic Model Predictive Control for Constrained Queueing Networks

Inventory Management with Advance Booking Information: The Case of Surgical Supplies and Elective Surgeries

Deep L1 Stochastic Optimal Control Policies for Planetary Soft Landing

Optimal dividend and proportional reinsurance strategy for the risk model with common shock dependence