Abstract

In this study, the estimation performances of Multiple Linear Regression, Random Forest, and Artificial Neural Network are examined comparatively. For comparison of these data mining techniques, the power production data from a Photovoltaic Module was used in the research. In this study, the model was constituted from seven variables. One of the variables is dependent (power) and the others are independent variables (global radiation, temperature, wind speed, wind direction, relative humidity, solar elevation angle). In this paper, the Mean Absolute Error and the correlation coefficient were used in order to compare the estimation performance of the mentioned data mining techniques. While the correlation coefficient is 0.963 in Multiple Linear Regression model, the correlation coefficient is 0.986 in Random Forest decision tree method. The highest correlation coefficient was obtained in Artificial Neural Network architecture (R = 0.997). According to the three data mining methods, the global radiation was found as the most important predictor. While the least important predictor is the wind direction in both the Artificial Neural Network and the Random Forest models, the solar elevation angle is the least important predictor in the Multiple Linear Regression model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.