Abstract

Anomaly detection for wind turbine condition monitoring is an active area of research within the wind energy operations and maintenance (O & M) community. In this paper three models were compared for multi-megawatt operational wind turbine SCADA data. The models used for comparison were One-Class Support Vector Machine (OCSVM), Isolation Forest (IF), and Elliptical Envelope (EE). Each of these were compared for the same fault, and tested under various different data configurations. IF and EE have not previously been used for fault detection for wind turbines, and OCSVM has not been used for SCADA data. This paper presents a novel method of condition monitoring that only requires two months of data per turbine. These months were separated by a year, the first being healthy and the second unhealthy. The number of anomalies is compared, with a greater number in the unhealthy month being considered correct. It was found that for accuracy IF and OCSVM had similar performances in both training regimes presented. OCSVM performed better for generic training, and IF performed better for specific training. Overall, IF and OCSVM had an average accuracy of 82% for all configurations considered, compared to 77% for EE.

Highlights

  • Wind turbines generate a large proportion of the UK electricity demand

  • The National Renewable Energy Laboratory has stated that operational expenditure (OPEX) costs for U.S offshore wind energy is around £62-187/kW/year [1], this can be as much as 25 to 30% of the cost of a wind farm itself [2]

  • Isolation Forest has not previously been used for fault detection in wind turbines, only as an outlier detection technique for cleaning the power curve, and whilst One-Class Support Vector Machine (OCSVM) has been used for fault detection, this has not been done for SCADA data

Read more

Summary

Introduction

Wind turbines generate a large proportion of the UK electricity demand. Operations and maintenance (O&M) has become a more significant area in the wind industry, especially in terms of cost of energy. The use of only two months of data was a request from the industrial partner to investigate whether there is a requirement to store and analyse long term data, or if it were possible to perform health classification using less data The advantages of this would be the shorter run time for training and testing, and less storage being required to keep the data over such long periods for immediate access and analysis. This approach is a comparative assessment of the performance of several models using such restricted data, to find if the turbines are operating in a healthy condition or not.

Literature Review
Previous Examples of the Models Examined in this Paper
Literature Review Summary
Anomaly Detection Method
Feature Selection
Data Normalisation
Model Description
Test Description
Table of Accuracies
Selected Results
Analysis of Condition Monitoring Method
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call