Abstract

Condition-based maintenance (CBM) of industrial machines depends on the continuous, real-time monitoring of the machine’s operational condition via smart sensors attached to different components on the machine. The problem of regularly spaced missing data, which can occur due to a variety of hardware or software issues, is one that is often overlooked in the literature surrounding CBM in industrial machines. Such missing data can cause issues in interpreting the true operational state of the machine, which can reduce the effectiveness of CBM processes. In this paper, we examine the capabilities of five data imputation techniques for handling this regular missing data and examine the impact these techniques have on machine learning (ML) classification algorithms for machine fault diagnosis. We examine the following techniques: simple mean imputation, mean imputation with outliers removed, best and worst-case imputation, and previous day imputation. Each of these methods is configured with the specific parameters that they will only consider data from the previous 24 hours, to ensure that the data is recent, and adequately represents the current status of the machine. The efficacy of each method at accurately reconstructing the missing data and the impact they have on ML classification is recorded in the results. The models are evaluated on a real-world dataset and are evaluated on a variety of common performance metrics.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call