Abstract
Predictive systems use historical and other available data to predict an event. In this paper we tries to compare the power of Artificial Neural Network (ANN) and Decision Tree (DT) in prediction of aerology events with time series streams and events stream using combination of K-means clustering algorithm and Decision Tree C5 algorithm and ANN. We try to find the effective parameters on events occurrences. Firstly, we find the closest time series record for any events; therefore, we have gathered different parameters value when an event is occurring. Using K-means we add a field to dataset which determines the cluster of each record and after that we predict the events using C5 algorithm and ANN. This framework and time series model can predict future events efficiently. We gathered 1961 until 2005 data of aerology organization for Tehran Mehrabad Station. This data contains some fields such as wet bulb, relative humidity, amount of cloud, wind speed and etc. This dataset includes 17 types of events. Using this framework the closest event can be predicted. The C5 method is able to predict events with 79.55% accuracy and ANN with 72.87% accuracy. Applying K-means clustering algorithm the prediction increase to 94.59% for C5 and 92.66% accuracies for ANN. We use 10-fold cross validation to evaluate our prediction rate. This framework is the first estimation in the area of event prediction for a huge dataset of aerology and can be extended in many different datasets in any other environments.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have