Abstract
Forecasting the dynamics of time-varying systems is essential to maintaining the sustainability of the systems.Recent studies have discovered that Recurrent Neural Networks(RNN) applied in the forecasting tasks outperform conventional models that include AutoRegressive Integrated Moving Average(ARIMA).However, due to the structural limitation of vanilla RNN which holds unit-length internal connections, learning the representation of time series with \textit{missing data} can be severely biased. The goal of this paper is to provide a robust RNN architecture against the bias from missing data. We propose Dilated Recurrent Attention Networks(DRAN).The proposed model has a stacked structure of multiple RNNs which layer of each having a different length of internal connections. This structure allows incorporating previous information at different time scales.DRAN updates its state by a weighted average of the layers.In order to focus more on the layer that carries reliable information against bias from missing data, it leverages attention mechanism which learns the distribution of attention weights among the layers.We report that our model outperforms conventional ones with respect to the forecast accuracy from two benchmark datasets, including a real-world electricity load dataset.
Highlights
An inaccurate forecast may pay an expensive price for financial and social deterioration which are unanticipated [3, 4]
Recurrent Neural Networks (RNN), a member of neural networks known for more flexibility with little prior assumptions, has become a standard framework for Short-Term Load Forecasting (STLF) tasks after outperforming conventional forecasting models that include AutoRegressive Integrated Moving Average (ARIMA) [4]
RNN can contribute to mitigating the bias from missing data by relying more on the previous information rather than the current missing data, as the internal connections play a role of memory
Summary
An inaccurate forecast may pay an expensive price for financial and social deterioration which are unanticipated [3, 4]. Since the reliability of the forecast has a strong impact on the economic feasibility of industry [1], Short-Term Load Forecasting (STLF) in time-varying systems has been explored actively Still, this is a difficult task as it depends on the nature of the system and external influences. RNN can contribute to mitigating the bias from missing data by relying more on the previous information rather than the current missing data, as the internal connections play a role of memory. We propose DRAN, a novel framework tailored for STLF tasks with missing data This inherits the properties of Dilated RNN (DRNN) [5], featured by a multi-layer and cell-independent architecture, where each layer has a different internal connection, referred to dilation. The model we suggest is readily applicable to other types of tasks but we limit ourselves to STLF tasks in this paper
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Proceedings of the Northern Lights Deep Learning Workshop
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.