Abstract
Multi-attempt mission aborting systems have recently received significant attention from the reliability community. Existing models mostly assume parallel or sequential execution of multiple attempts, incurring great cost or low mission success probability (MSP), respectively. This paper advances the state of the art by considering a new model, where system components may be activated with certain delay allowing to activate next one before the previous component leaves the operation, balancing the expected cost of lost components (ECC) and MSP. Each component may abort its attempt according to an individual aborting policy defined by two parameters (the number of survived shocks and an operation time threshold) or upon receiving a common abort command. Because components may have different shock resistances and performance rates, their activation order can affect both MSP and ECC. Thus, we formulate and solve the optimal attempt scheduling and aborting policy (SAP) problem, which determines the vector of component activation times and the individual attempt aborting policy for each component to minimize the expected mission losses (EML). The EML, a function of MSP and ECC, is evaluated using a new numerical procedure. A detailed case study of a cloud data processing system is provided to demonstrate the proposed model.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.