Abstract

Machine learning and statistical methods can help model meteorological phenomena, especially in a context with many variables. However, it is not unusual that the measurement of those variables fails, generating data gaps and compromising data history analysis. The framework combines the predictions provided by three machine learning methods: decision trees, artificial neural networks and support vector machine, together with values calculated through five triangulation methods: arithmetic average, inverse distance weighted, optimized inverse distance weighted, optimized normal ratio and regional weight. Each machine learning algorithm generates eight regression models. One of the machine learning models makes predictions based only on the date. The remaining seven models make predictions based on one weather parameter (max. temperature, min. temperature, insolation, among others), in addition to the respective date. The triangulation methods use the climatic data from three neighboring cities to estimate the parameter of the target city. The generated dataset is, posteriorly, optimized by meta-learning algorithms. The results show that the additional information provided by the new machine learning models and the triangulation methods offered a significant increase in the accuracy of the imputed data. Moreover, the statistical analysis and coefficient of determination R² showed that the meta-learning model based on regression trees successfully combined the base-level outputs to generate outputs that best fill in the missing values of the time series studied in this paper.

Highlights

  • IntroductionThe weather is a relevant factor that impacts the management and planning of different areas, such as 984

  • The weather is a relevant factor that impacts the management and planning of different areas, such as 984agriculture, energy generation and heavy civil construction

  • This work presents the application of a meta-learning framework to estimate missing weather values in climatic time series

Read more

Summary

Introduction

The weather is a relevant factor that impacts the management and planning of different areas, such as 984. Agriculture, energy generation and heavy civil construction. This relationship with the weather has motivated several searches in the area aiming to understand it (YANG et al, 2007). To predict the future condition of the weather, both machine learning and predictive analytics consider historical information to learn from past events, trying to recognize patterns. Concerning the elaboration of a study, it is crucial to verify data availability. Using a complete and reliable data set, it is possible to generate studies with fewer errors (BAYMA; PEREIRA, 2018, 2017). Inconsistencies and unsatisfactory data volumes cause a limited or even a false representation of the actual picture (GARCÍA et al, 2009)

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.