Multi-step forecasting for big data time series based on ensemble learning

A Galicia,R Talavera-Llames,A Troncoso,I Koprinska,F Martínez-Álvarez

doi:10.1016/j.knosys.2018.10.009

Abstract

This paper presents ensemble models for forecasting big data time series. An ensemble composed of three methods (decision tree, gradient boosted trees and random forest) is proposed due to the good results these methods have achieved in previous big data applications. The weights of the ensemble are computed by a weighted least square method. Two strategies related to the weight update are considered, leading to a static or dynamic ensemble model. The predictions for each ensemble member are obtained by dividing the forecasting problem into h forecasting sub-problems, one for each value of the prediction horizon. These sub-problems have been solved using machine learning algorithms from the big data engine Apache Spark, ensuring the scalability of our methodology. The performance of the proposed ensemble models is evaluated on Spanish electricity consumption data for 10 years measured with a 10-minute frequency. The results showed that both the dynamic and static ensembles performed well, outperforming the individual ensemble members they combine. The dynamic ensemble was the most accurate model achieving a MRE of 2%, which is a very promising result for the prediction of big time series. Proposed ensembles are also evaluated using solar power from Australia for two years measured with 30-min frequency. The results are successfully compared with Artificial Neural Network, Pattern Sequence-based Forecasting and Deep Learning, improving their results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-step forecasting for big data time series based on ensemble learning

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Oct 12, 2018
Citations: 177

Similar Papers

An optimization in big data time series prediction method by Parzen estimation with Spark
Hao Liu
Theoretical and Natural Science | VOL. 18
Hao LiuHao Liu
08 Dec 2023
Theoretical and Natural Science | VOL. 18

Soft computing applications in the field of industrial and environmental enterprises
Alfredo Jimenez ... Álvaro Herrero
Expert Systems | VOL. 36
Alfredo Jimenez, et. al.Alfredo Jimenez ... Álvaro Herrero
01 Aug 2019
Expert Systems | VOL. 36

Dynamic stacking ensemble for cross-language code smell detection.
Hamoud Aljamaan
PeerJ. Computer science | VOL. 10
Hamoud AljamaanHamoud Aljamaan
15 Aug 2024
PeerJ. Computer science | VOL. 10

Distributed Evolutionary Hyperparameter Optimization for Fuzzy Time Series
Petronio C L Silva ... Patricia De Oliveira E Lucas
IEEE Transactions on Network and Service Management | VOL. 17
Petronio C L Silva, et. al.Petronio C L Silva ... Patricia De Oliveira E Lucas
01 Sep 2020
IEEE Transactions on Network and Service Management | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-step forecasting for big data time series based on ensemble learning

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems