Abstract

Short-term Load Forecasting (STLF) is the basis of smart distribution network system operation, planning, and dispatching. The traditional linear regression prediction method has the problems of slow prediction speed and low prediction accuracy. In order to solve the problem, an improved regression model based on mini-batch stochastic gradient descent is proposed in this paper. Combined with the big data analysis and processing platform, the collected data is conformed, and the parallel computing model Map-Reduce is used to parallelize mini-batch stochastic gradient descent algorithm for improving the processing ability of mini-batch stochastic gradient descent algorithm in big data load forecasting, and shorten load forecasting time. Meanwhile, in order to clean up the duplicated data and bad data generated by the smart meter and sensor before calculation, an adaptive sorted neighborhood method is proposed to detect the repeatedly recorded data, and the K-means clustering method is used to eliminate the noise data .The experimental results show that the parallelized mini-batch stochastic gradient descent algorithm is much faster than the traditional regression analysis algorithm when the data volume is large. The average absolute percentage error of the load forecasting model for Belgium and a transformer station in Baiyin city of Gansu Province in China is 1.902% and 2.058% respectively, which satisfies the requirements of load forecasting.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call