Optimal Classifier to Detect Unit of Measure Inconsistency in Gas Turbine Sensors

Lucrezia Manservigi,Javier Artal De La Iglesia,Giovanni Bechini,Mauro Venturini,Enzo Losi

doi:10.3390/machines10040228

Lucrezia Manservigi, Javier Artal De La Iglesia + Show 3 more

Open Access

https://doi.org/10.3390/machines10040228

Copy DOI

Journal: Machines	Publication Date: Mar 24, 2022
Citations: 5	License type: CC BY 4.0

Affiliation: University of Ferrara, Siemens (Germany)

Abstract

Label noise is a harmful issue that arises when data are erroneously labeled. Several label noise issues can occur but, among them, unit of measure inconsistencies (UMIs) are inexplicably neglected in the literature. Despite its relevance, a general and automated approach for UMI detection suitable to gas turbines (GTs) has not been developed yet; as a result, GT diagnosis, prognosis, and control may be challenged since collected data may not reflect the actual operation. To fill this gap, this paper investigates the capability of three supervised machine learning classifiers, i.e., Support Vector Machine, Naïve Bayes, and K-Nearest Neighbors, that are tested by means of challenging analyses to infer general guidelines for UMI detection. Classification accuracy and posterior probability of each classifier is evaluated by means of an experimental dataset derived from a large fleet of Siemens gas turbines in operation. Results reveal that Naïve Bayes is the optimal classifier for UMI detection, since 88.5% of data are correctly labeled with 84% of posterior probability when experimental UMIs affect the dataset. In addition, Naïve Bayes proved to be the most robust classifier also if the rate of UMIs increases.

Full Text