Seasonal hydrological forecasts play a critical role in water resources management. The Copernicus Climate Change Service (C3S) data store provides open access to monthly hydrological forecasts for up to six-months. This study aims to evaluate, for the first time, 1- to 3-month runoff forecasts using the European Centre for Medium-Range Weather Forecasts (ECMWF) ensembles of precipitation, runoff, and temperature in 1981–2015 period over a total of 30 s-level basins in Iran. We adopted the 5th, 50th and 95th ECMWF ensemble quantiles for each variable that represent low, medium and high probability of occurrence, respectively. Pearson correlation analysis (Pca), Recursive Feature Elimination (RFE) via random forest (RF) model, and Bayesian Networks (BN) feature selection algorithms were used in order to reduce input variable dimension and select potential predictors to be fed to the machine learning models. Multiple Linear Regression (MLR), Artificial Neural Networks (ANN), Support Vector Regression (SVR), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost) machine learning models were used with Repeated K-Fold cross validation (rK-Fold CV) while model efficiency was evaluated using modified Kling-Gupta efficiency coefficient (KGE’), Nash-Sutcliffe Efficiency coefficient (NSE), and Normalized Root Mean Square Error (NRMSE). Results of this study revealed that C3S runoff ensembles have the highest impact on forecast accuracy of streamflow, followed by precipitation and temperature. Overall, model performance yield a best-to-worst ranking of ANN, XGBoost, RF, MLR, and SVR with KGE’ values of 0.70, 0.68, 0.66, 0.57, and 0.41, respectively. The predictive performance of all models decreased with lead times beyond 1-month, where ANN and XGBoost outperformed other models with KGE’ of 0.65 for 2-month lead time and 0.60 for 3-month lead time. The three superior models of XGBoost, ANN, and RF, were employed with RFE and BN FSAs most frequently across Iran’s 30 s level basins in all lead times. Almost all models in the arid central region of Iran showed the lowest performance while highest skills were achieved in the western regions of Iran. Finally, for all models and over all regions, the model performance reduced by increase in lead-time.
Read full abstract