Abstract

Background: Through the analysis of the relevant data of industrial equipment, faults diagnosis is helpful for system maintenance and reducing economic losses. Objective: The aim is to reduce the influence of irrelevant features and efficiently train the FIR-XgBoost model. Methods: An Extreme Gradient Boosting (XgBoost) approach based on feature importance ranking (FIR) is proposed in this article for fault classification of high- dimensional complex industrial systems. Gini index is applied to rank the importance of the features, and feature selection is implemented based on their position in the ranking. Results: The dataset from the PHM 2021 data challenge, which is related to the process of fuse thermal imaging, is used. The classification accuracy of FIR-XgBoost reaches 99.63%, outperforming other existing algorithms. A case study is presented to show that excellent fault classification can be achieved through ensemble learning and feature selection. Conclusion: Data-driven machine learning methods are proposed for solving high-dimensional fault classification problems on the dataset of the PHM2021 Data Challenge. An FIR-XgBoost method is proposed, the core of which is to retain important features and to reduce redundancy of sensor data. Consequently, feature selection based on FIR has better interpretability than other algorithms. Furthermore, the FIR- XgBoost algorithm retaining the 50 most important features achieves the best fault classification performance among the compared algorithms and can be implemented in specific industrial processes.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call