Abstract
In practical industrial applications, the need for a large number of accurately labeled training samples is a significant challenge for fault detection tasks. However, labeling all training samples is expensive and prone to labeling errors, especially for early fault detection of wind turbine blades. This paper proposes a labeling bias (LB) hypothesis. Assuming the labeler only needs to label parts of normal samples that are easy to judge, we design a probability ratio least-squares importance fitting (PRL-SIF) method based on variable homogeneity. Unlike other state-of-the-art positive unlabeled (PU) learning methods, PRL-SIF does not require knowledge of the class priors to achieve training. Furthermore, to better handle the multi-dimensional time-series data of wind turbines, we provide a data preprocessing method based on functional analysis to achieve time series feature extraction and dimensionality reduction. The effectiveness and robustness of the proposed method are verified on 23 real-world wind turbine datasets. Experimental results show that the proposed method can achieve nearly 90% accuracy while only labeling 20% of normal samples.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.