Accurate wind power curves (WPCs) are crucial for wind energy development and utilization, e.g., wind power forecasting and wind turbine condition monitoring. In the era of big data, large-scale datasets make the training of power curve models inefficient, especially for kernel-based models. Furthermore, most models do not take into account the error characteristics of WPC modeling. In this study, a large-scale generalized kernel-based regression model is proposed to solve the above problem. First, a generalized loss function, which can model both symmetric and asymmetric error distributions, is designed for model training. Then, the Nyström technique is employed to get the approximate kernel matrix, based on which an eigenvalue-based kernel regression framework is constructed. Next, a large-scale generalized kernel-based regression model is developed with model parameters tuned using the alternating direction method of multipliers. Before WPC modeling, a three-step data processing method based on isolation forest is designed to process missing data, irrational data, and outliers in the collected data. The WPC modeling results on four large-scale wind datasets demonstrate that the proposed model generates accurate WPCs with high efficiency. Furthermore, the effect of turbulence intensity on WPC modeling and the effectiveness of LSGKRM with multivariate inputs are also verified.
Read full abstract