Control Of Stochastic Systems Research Articles

We consider a stochastic lost-sales inventory control system with lead time L over a planning horizon T. Supply is uncertain, and it is a function of the order quantity (because of random yield/capacity, etc.). We aim to minimize the T-period cost, a problem that is known to be computationally intractable even under known distributions of demand and supply. In this paper, we assume that both the demand and supply distributions are unknown and develop a computationally efficient online learning algorithm. We show that our algorithm achieves a regret (i.e., the performance gap between the cost of our algorithm and that of an optimal policy over T periods) of [Formula: see text] when [Formula: see text]. We do so by (1) showing that our algorithm’s cost is higher by at most [Formula: see text] for any [Formula: see text] compared with an optimal constant-order policy under complete information (a widely used algorithm) and (2) leveraging the latter’s known performance guarantee from the existing literature. To the best of our knowledge, a finite sample [Formula: see text] (and polynomial in L) regret bound when benchmarked against an optimal policy is not known before in the online inventory control literature. A key challenge in this learning problem is that both demand and supply data can be censored; hence, only truncated values are observable. We circumvent this challenge by showing that the data generated under an order quantity q2 allow us to simulate the performance of not only q2 but also, q1 for all [Formula: see text], a key observation to obtain sufficient information even under data censoring. By establishing a high-probability coupling argument, we are able to evaluate and compare the performance of different order policies at their steady state within a finite time horizon. Because the problem lacks convexity, commonly used learning algorithms, such as stochastic gradient decent and bisection, cannot be applied, and instead, we develop an active elimination method that adaptively rules out suboptimal solutions. This paper was accepted by Victor Martínez-de-Albéniz, operations management. Funding: This work is supported by the National Science Foundation [Grant CCF-2312205]. Z. Zhou also acknowledges the New York University’s 2024 Center for Global Economy and Business [Faculty Research Grant] and New York University [Research Catalyst Prize]. Supplemental Material: The online appendix is available at https://doi.org/10.1287/mnsc.2022.02476 .

Read full abstract

In the dynamic business environment of today, decision-making involves considering various conflicting aspects. Inventory planning problems aim to determine how much and when to order products to satisfy customer demand at the lowest possible cost while maintaining a desirable service level. These problems can be formulated as a MOPSO algorithm, which is used to handle multiple objectives in a continuous review stochastic inventory control system (r, Q). Unfortunately, most multi-objective inventory models have been solved by aggregating objectives using specific weights or by optimizing only one objective and treating the others as constraints. Considering the complexity of real-world inventory control problems, which involves conflicting objectives such as minimizing cost and maximizing service level, the need arises to employ more precise optimizers that can generate better and more diverse non-dominated solutions of reorder point and order size system. In this paper multi-criteria decision-making framework that combines MOPSO algorithm and TOPSIS method to generate a Pareto front of non-dominated solutions and rank them based on decision makers' preferences. Initially, the original MOPSO is applied to the multi-objective inventory control problem, and then the mutation operator is integrated into the MOPSO to maintain diversity in the swarm and explore the entire search space. Next, the leader selection strategy called the geographically-based system (Grids) is replaced by the crowding distance factor to choose the global optimal particle as a leader. Additionally, the ε-dominance concept is employed to limit the archive size and maintain more diversity and convergence in the MOPSO for optimizing the inventory control problem. In conclusion, this work not only pioneers a cutting-edge approach to multi objective inventory control but also underscores its practical value. By facilitating the generation of superior solutions that cater to diverse decision-maker preferences, our methodology resonates deeply with real-world challenges and sets a new benchmark for effective inventory planning.

Read full abstract

Control Of Stochastic Systems Research Articles

Articles published on Control Of Stochastic Systems

Existence of a mild solution and approximate controllability for fractional random integro-differential inclusions with non-instantaneous impulses

Fault Isolation and Fault‐Tolerant Control Design for Non‐Gaussian Stochastic Distribution Control Systems With Multiple Sensor Faults

A Lyapunov–Razumikhin control strategy for stochastic nonlinear delayed systems with polynomial conditions

Pth moment asymptotic stability for stochastic complex networked control systems with Lévy noise

Finite time prescribed performance control for stochastic systems with asymmetric error constraint and actuator faults

A Novel Accelerated Multistage Learning Control Mechanism via Virtual Performance Reduction.

Adaptive output feedback control of stochastic systems with mismatched uncertainties input–output quantization

Controllability of partially observed stochastic semilinear fractional control systems

Design stabilisers for multi-input affine control stochastic systems via stochastic control Lyapunov functions

Disturbance observer based fixed-time control of stochastic systems

On convergence of occupational measures sets of a discrete-time stochastic control system, with applications to averaging of hybrid systems

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

Multi-objective continuous review inventory policy using MOPSO and TOPSIS methods

Optimal Impulse Control for Systems Deriven by Stochastic Delayed Differential Equations

A model of the radar situation of the coastal vessel traffic control system

New exploration on approximate controllability of fractional neutral‐type delay stochastic differential inclusions with non‐instantaneous impulse

The second-order maximum principle for partially observed optimal controls

Risk-Aware MPC for Stochastic Systems with Runtime Temporal Logics

A General Maximum Principle for Discrete Fractional Stochastic Control System of Mean‐Field Type

A stochastic maximum principle for forward–backward stochastic control systems with quadratic generators and sample-wise constraints

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Control Of Stochastic Systems Research Articles

Articles published on Control Of Stochastic Systems

Existence of a mild solution and approximate controllability for fractional random integro-differential inclusions with non-instantaneous impulses

Fault Isolation and Fault‐Tolerant Control Design for Non‐Gaussian Stochastic Distribution Control Systems With Multiple Sensor Faults

A Lyapunov–Razumikhin control strategy for stochastic nonlinear delayed systems with polynomial conditions

Pth moment asymptotic stability for stochastic complex networked control systems with Lévy noise

Finite time prescribed performance control for stochastic systems with asymmetric error constraint and actuator faults

A Novel Accelerated Multistage Learning Control Mechanism via Virtual Performance Reduction.

Adaptive output feedback control of stochastic systems with mismatched uncertainties input–output quantization

Controllability of partially observed stochastic semilinear fractional control systems

Design stabilisers for multi-input affine control stochastic systems via stochastic control Lyapunov functions

Disturbance observer based fixed-time control of stochastic systems

On convergence of occupational measures sets of a discrete-time stochastic control system, with applications to averaging of hybrid systems

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

Multi-objective continuous review inventory policy using MOPSO and TOPSIS methods

Optimal Impulse Control for Systems Deriven by Stochastic Delayed Differential Equations

A model of the radar situation of the coastal vessel traffic control system

New exploration on approximate controllability of fractional neutral‐type delay stochastic differential inclusions with non‐instantaneous impulse

The second-order maximum principle for partially observed optimal controls

Risk-Aware MPC for Stochastic Systems with Runtime Temporal Logics

A General Maximum Principle for Discrete Fractional Stochastic Control System of Mean‐Field Type

A stochastic maximum principle for forward–backward stochastic control systems with quadratic generators and sample-wise constraints