Abstract

A novel active semi-supervised learning framework using unlabeled data is proposed for fault identification in labeled expensive chemical processes. A principal component analysis (PCA) feature selection strategy is first given to calculate the weight of the variables. Secondly, the identification model is trained based on the obtained key process variables. Thirdly, the pseudo label confidence of identification model is dynamically optimized with an historical, current, and future pseudo label confidence mean. To increase the upper limit of the identification model that is self-learning with high entropy process data, active learning is used to identify process data and diagnosis fault causes by ontology. Finally, a PCA-dynamic active safe semi-supervised support vector machine (PCA-DAS4VM) for fault identification in labeled expensive chemical processes is built. The application in the Tennessee Eastman (TE) process shows that this hybrid technology is able to: (i) eliminate chemical process noise and redundant process variables simultaneously, (ii) combine historical pseudo label confidence with future pseudo label confidence to improve the identification accuracy of abnormal working conditions, (iii) efficiently select and diagnose high entropy unlabeled process data, and (iv) fully utilize unlabeled data to enhance the identification performance.

Highlights

  • These methods are still limited to supervised learning, resulting in poor generalization performance, a low industrial fault diagnosis rate (FDR), and a high false positive rate (FPR)

  • This paper proposes a fault identification method principal component analysis (PCA)-DAS4VM based on a graphical scenario object model, which improves the identification accuracy of traditional SVM due to its full use of the unlabeled data distribution information

  • In order to better prove the effectiveness of the proposed method, the PCA-DAS4VM proposed in this paper is compared with the DAS4VM and PCA-S4VM fault identification methods when applied to the Tennessee Eastman (TE)

Read more

Summary

Introduction

Minor abnormality generally refers to abnormal conditions, such as regulator failure or alarms caused by abnormal fluctuations. It is of great theoretical and practical significance to conduct fault identification for chemical process to quickly discover the potential abnormality and to reliably maintain the stationary operation of chemical plants. The existing fault identification methods are mainly divided into: Qualitative methods [2], quantitative methods [3,4], and data-driven methods [5,6]. Among all of the data-driven fault identification methods, the supervised machine learning technique provides impressive fault identification results for the chemical process [7,8]. Mohd Azlan Hussain et al [9] proposed the kernel fisher discriminant

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call