Abstract
Abstract Aiming at the data preprocessing requirements and label data cost issues arising from the intelligent operation and maintenance of electromechanical equipment, this article mainly studies structured data cleaning methods and fault prediction algorithms for a small number of label samples. First, this article introduces the overall architecture of the intelligent operation and maintenance system for electromechanical equipment. Second, based on the electromechanical equipment operation and maintenance data access service, data cleaning, and fault prediction, this article constructs an electromechanical equipment intelligent operation and maintenance platform based on Kafka message queue, Spark cluster, and other components, and introduces the functional composition of the system in detail. Finally, the article describes the functions of each component of data access service, data cleaning, and fault prediction in detail. To address the cost issue associated with sufficient labeled sample data for data analysis, we propose a semi-supervised learning algorithm, IF-GBDT, based on improved independent forests and Gradient Boosting Decision Tree. The independent forest algorithm supplements labels for unlabeled data based on the learning results of a small number of labeled samples. We also use the gradient lifting tree algorithm to train the model based on the new tag data set for fault prediction, thereby reducing the impact of lack of tags on the accuracy of the prediction model. Experiments show that this method improves classification accuracy and has good adaptability and concurrency performance for a small number of tags.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.