Abstract
Short-term load forecasting is a critical task in the smart grid, which can be used to optimize power deployment and reduce power losses. Recurrent neural networks (RNNs) are the most popular deep learning models for short-term load forecasting. However, despite of achieving better forecasting accuracy than the traditional models, the performance of the existing RNN-based load forecasting approaches is still unsatisfactory for practical usage. Therefore, in this work, we have proposed input attention mechanism (IAM) and hidden connection mechanism (HCM) to greatly enhance the accuracy and efficiency of RNN-based load forecasting models. Specifically, we use IAM to assign the importance weights on input layers, which have better performances in both efficiency and accuracy than traditional attention mechanisms. To further enhance the models’ efficiency, HCM is then applied to utilize residual connections to enhance the model’s converging speed. We have applied both IAM and HCM on four state-of-the-art RNN implementations, and then conducted extensive experimental studies on two public datasets. Experimental results show that the proposed RNNs with IAM and HCM models achieve much better performances than the state-of-the-art baselines in both accuracy and efficiency. Ablation studies show that both IAM and HCM are essential to achieve such superior performances.
Highlights
The goal of Short-term load forecasting (STLF) is to forecast the load values in the few hours or days based on historical load values that sometimes consider the weather, temperature and other factors
The Recurrent neural networks (RNNs)-based models with input attention mechanism (IAM) and hidden connection mechanism (HCM) improves the accuracy and significantly decreases the convergent time
The root mean square error (RMSE), mean absolute percentage error (MAPE), and convergent time of Bi-RNN-based models with IAM and HCM decreased by 49.58%, 55.87%
Summary
The goal of Short-term load forecasting (STLF) is to forecast the load values in the few hours or days based on historical load values that sometimes consider the weather, temperature and other factors. Recurrent Neural Networks are an effective method for short-term load forecasting in recent years [3]. The recurrent neural networks can take the temporal correlation of the time series and use historical information of any length. The inherent shortcoming of gradient explosion and gradient vanishing during training in traditional recurrent neural networks is a restraint. In response to the deficiency of traditional recurrent neural network, long short-term memory network (LSTM) is proposed to overcome the disadvantage by the gate mechanisms [4]. The calculation method of the hidden layer can be changed through these mechanisms, thereby effectively combining short-term memory with long-term memory. In [5], LSTM is applied
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.