The smart grid (SG) ensures the flow of electricity and data between suppliers and consumers. The reliability and security of data also play an important role in the overall management. This can be achieved with the help of adaptive energy management (AEM). This research aims to highlight the big data issues and challenges faced by AEM employed in SG networks. In this paper, we will discuss the most commonly used data processing methods and will give a detailed comparison between the outputs of some of these methods. We consider a dataset of 50,000 instances from consumer smart meters and 10,000 attributes from previous fault data and 12 attributes. The comparison will tell us about the reliability, stability, and accuracy of the system by comparing the output of the various graphical plots of these methods. The accuracy percentage of the linear regression method is 98%; for the logistic regression method, it is 96%; and for K-Nearest Neighbors, it is 92%. The results show that the linear regression method applied gives the highest accuracy compared to logistic regression and K-Nearest Neighbors methods for prediction analysis of big data in SGs. This will ensure their use in future research in this field.
Read full abstract