Efficient modeling and optimizing of checkpointing in concurrent component-based software systems

Noor Bajunaid,Daniel A Menascé

doi:10.1016/j.jss.2018.01.032

Noor Bajunaid, Daniel A Menascé

Open Access

https://doi.org/10.1016/j.jss.2018.01.032

Copy DOI

Journal: Journal of Systems and Software	Publication Date: Feb 2, 2018
Citations: 6	License type: publisher-specific-oa

Affiliation: George Mason University

Abstract

A common mechanism to improve availability and performance is checkpointing and rollback. When it is time to checkpoint, a system stores a job’s state to nonvolatile memory, and, when a failure occurs, it rolls back to the latest stored state instead of restarting the job from the beginning, thus improving performance in the presence of failures. Too frequent checkpointing reduces the amount of work to be redone in case of failures but generates excessive overhead, degrading performance. This paper presents a novel and very efficient queuing network model that addresses software component contention for hardware resources and shows how it can be used to model checkpointing in heterogeneous component-based software systems. We validated this model against a previous model, developed by the authors, that used Markov Chains. Our new model is orders of magnitude faster than the previous one and can be used to plan for checkpointing at run-time. As an additional contribution of this paper, we present an optimizer to find, for each software component, the optimal checkpointing interval that minimizes execution time, maximizes availability, or minimizes checkpointing overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient modeling and optimizing of checkpointing in concurrent component-based software systems

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software

Lead the way for us

Similar Papers

Research on Optimal Checkpointing-Interval for Flink Stream Processing Applications
Zhan Zhang ... Xian Liu
Mobile Networks and Applications | VOL. 26
Zhan Zhang, et. al.Zhan Zhang ... Xian Liu
06 Jan 2021
Mobile Networks and Applications | VOL. 26

An Optimal Checkpointing Model with Online OCI Adjustment for Stream Processing Applications
Yuan Zhuang ... Xiaohui Wei
-
Yuan Zhuang, et. al.Yuan Zhuang ... Xiaohui Wei
01 Jul 2018
01 Jul 2018

Selecting the checkpoint interval in time warp simulation
Yi-Bing Lin ... Edward D Lazowska
ACM SIGSIM Simulation Digest | VOL. 23
Yi-Bing Lin, et. al.Yi-Bing Lin ... Edward D Lazowska
01 Jul 1993
ACM SIGSIM Simulation Digest | VOL. 23

Selecting the checkpoint interval in time warp simulation
Yi-Bing Lin ... Wayne M Loucks
-
Yi-Bing Lin, et. al.Yi-Bing Lin ... Wayne M Loucks
01 Jul 1993
01 Jul 1993

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient modeling and optimizing of checkpointing in concurrent component-based software systems

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software