Abstract
To improve the reliability of the cloud computing system it is important to understand the failure characteristics and to predict failures earlier to avoid it. A statistical analysis of workload data on the cloud provides insights into failure characteristics, which can be used as a cue to improve the reliability of the system. This manuscript discusses a statistical analysis of resource usage data of tasks on the large Google cluster dataset, further failure prediction algorithms are developed to predict the failure. Based on the study, it is observed that there is variation in the resource usage pattern, duration of execution and amount of resource used by a failed task as compared to that of a finished task. Different resampling techniques along with XGboost classifier is used to predict the failure of a task in the highly imbalanced dataset and it is observed that Synthetic minority oversampling along with XGboost predicted the task status with precision of 92% and recall of 94.8%.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.