Abstract
The prediction of monthly rainfall is greatly beneficial for water resources management and flood control projects. Machine learning (ML) techniques, as an increasingly popular approach, have been applied in diverse climatic regions, showing their respective superiority. On top of that, the ensemble learning model that synthesizes the advantages of different ML models deserves more attention. In this study, an ensemble learning model based on stacking approach was proposed. Four prevalent ML models, namely k-nearest neighbors (KNN), extreme gradient boosting (XGB), support vector regression (SVR), and artificial neural networks (ANN) are taken as base models. To combine the outputs from the base models, the weighting algorithm is used as second-layer learner to generate predictions. Large-scale climate indices, large-scale atmospheric variables, and local meteorological variables were used as predictors. R2, RMSE and MAE, were used as evaluation metrics. The results show that the performance of base models varied among the nine stations in the Taihu Basin, while the stacking approach generally performed better than the four base models. The stacking model showed better performance in spring and winter than in summer and autumn. During wet months, the accuracy of model prediction varied more significantly. On the whole, based on performance evaluation measures, it is concluded that the proposed stacking ensemble multi-ML model can provide a flexible and reasonable prediction framework applicable to other regions.
Highlights
Rainfall is an essential component in the hydrological cycle
The autoregressive model (AR) [11], the autoregressive moving average (ARMA) model [12,13] and the autoregressive moving integrated average (ARIMA) model [14,15] have been widely used for hydrological series predicting
Four base models and the stacking model are constructed at nine stations in the Taihu basin for prediction of monthly rainfall
Summary
Rainfall prediction is a fundamental issue in hydrological application. For modeling precipitation, numerical models based on the physical mechanisms and the statistical models were commonly employed [7,8]. The numerical models are based on the physical equations, including the complex process of atmosphere, ocean and land [9,10]. The statistical model is an approach of acquiring the features of historical rainfall time series and predicting the evolution based on these features. The autoregressive model (AR) [11], the autoregressive moving average (ARMA) model [12,13] and the autoregressive moving integrated average (ARIMA) model [14,15] have been widely used for hydrological series predicting
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.