Abstract

Hard disk drives (HDD) are used for data storage in personal computing platforms as well as commercial datacenters. An abrupt failure of these devices may result in an irreversible loss of critical data. Most HDD use self-monitoring, analysis, and reporting technology (SMART), and record different performance parameters to assess their own health. However, not all SMART attributes are effective at detecting a failing HDD. In this paper, a two-tier approach is presented to select the most effective precursors for a failing HDD. In the first tier, a genetic algorithm (GA) is used to select a subset of SMART attributes that lead to easily distinguishable and well clustered feature vectors in the selected subset. The GA finds the optimal feature subset by evaluating only combinations of SMART attributes, while ignoring their individual fitness. A second tier is proposed to filter the features selected using the GA by evaluating each feature independently, using a significance score that measures the statistical contribution of a feature towards disk failures. The resultant subset of selected SMART attributes is used to train a generative classifier, the naïve Bayes classifier. The proposed method is tested on a SMART dataset from a commercial datacenter, and the results are compared with state-of-the-art methods, indicating that the proposed method has a better failure detection rate and a reasonable false alarm rate. It uses fewer SMART attributes, which reduces the required training time for the classifier and does not require tuning any parameters or thresholds.

Highlights

  • The annual shipments of tablet computing devices surpassed personal computers in 2013, making flash memory the predominant mode of data storage for digital consumers [1]

  • This study proposes a two-tier approach for feature selection using a genetic algorithm (GA) and feature significance function

  • When an naïve Bayes (NB) classifier is trained using the features selected by the proposed two-tier approach, given in Table 2, it achieves an average classification accuracy of 99.01% with a false positive rate (FPR) of 0.24%

Read more

Summary

Introduction

The annual shipments of tablet computing devices surpassed personal computers in 2013, making flash memory the predominant mode of data storage for digital consumers [1]. In [3,4,6,10], the authors analyze logs and replacement records of more than 100, 000 hard disks that were determined to be faulty and replaced by the customers These studies do not investigate the causes of disk failures; rather, they provide an analysis of the data from a reliability perspective, and do not offer any useful clues for predicting a failing HDD.

The Proposed Methodology
Feature Selection
Feature Selection Using a Genetic Algorithm
Feature Selection Using Significance Scores
Classification Using the Naive Bayes Classifier
The SMART Dataset
Results and Discussion
Method
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call