Abstract

Numerical weather prediction (NWP) data possess internal inaccuracies, such as low NWP wind speed corresponding to high actual wind power generation. This study is intended to reduce the negative effects of such inaccuracies by proposing a pure data-selection framework (PDF) to choose useful data prior to modeling, thus improving the accuracy of day-ahead wind power forecasting. Briefly, we convert an entire NWP training dataset into many small subsets and then select the best subset combination via a validation set to build a forecasting model. Although a small subset can increase selection flexibility, it can also produce billions of subset combinations, resulting in computational issues. To address this problem, we incorporated metamodeling and optimization steps into PDF. We then proposed a design and analysis of the computer experiments-based metamodeling algorithm and heuristic-exhaustive search optimization algorithm, respectively.Experimental results demonstrate that (1) it is necessary to select data before constructing a forecasting model; (2) using a smaller subset will likely increase selection flexibility, leading to a more accurate forecasting model; (3) PDF can generate a better training dataset than similarity-based data selection methods (e.g., K-means and support vector classification); and (4) choosing data before building a forecasting model produces a more accurate forecasting model compared with using a machine learning method to construct a model directly.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call