Abstract

Abstract Aiming at the data preprocessing requirements and label data cost issues arising from the intelligent operation and maintenance of electromechanical equipment, this article mainly studies structured data cleaning methods and fault prediction algorithms for a small number of label samples. First, this article introduces the overall architecture of the intelligent operation and maintenance system for electromechanical equipment. Second, based on the electromechanical equipment operation and maintenance data access service, data cleaning, and fault prediction, this article constructs an electromechanical equipment intelligent operation and maintenance platform based on Kafka message queue, Spark cluster, and other components, and introduces the functional composition of the system in detail. Finally, the article describes the functions of each component of data access service, data cleaning, and fault prediction in detail. To address the cost issue associated with sufficient labeled sample data for data analysis, we propose a semi-supervised learning algorithm, IF-GBDT, based on improved independent forests and Gradient Boosting Decision Tree. The independent forest algorithm supplements labels for unlabeled data based on the learning results of a small number of labeled samples. We also use the gradient lifting tree algorithm to train the model based on the new tag data set for fault prediction, thereby reducing the impact of lack of tags on the accuracy of the prediction model. Experiments show that this method improves classification accuracy and has good adaptability and concurrency performance for a small number of tags.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call