To estimate rainfall from remote sensing data, three machine learning-based regression models, K-Nearest Neighbors Regression (K-NNR), Support Vector Regression (SVR), and Random Forest Regression (RFR), were implemented using MSG (Meteosat Second Generation) satellite data. Daytime and nighttime data from a rain gauge are used for model training and validation. To optimize the results, the outputs of the three models are combined using the weighted average. The combination of the three models (hereafter called Com-RSK) markedly improved the predictions. Indeed, the MAE, MBE, RMSE and correlation coefficient went from 23.6 mm, 10.0 mm, 40.6 mm and 89% for the SVR to 20.7 mm, 5.5 mm, 37.4 mm, and 94% when the models were combined, respectively. The Com-RSK is also compared to a few methods using the classification in the estimation, such as the ECST Enhanced Convective Stratiform Technique (ECST), the MMultic technique, and the Convective/Stratiform Rain Area Delineation Technique (CS-RADT). The Com-RSK show superior performance compared to ECST, MMultic and CS-RADT methods.The Com-RSK is also compared to the two products of satellite estimates, namely CMORPH and CHIRPS. The results indicate that Com-RSK performs better than CMORPH and CHIRPS according to MBE, RMSE and CC (coefficient correlation). A comparison with three types of satellite precipitation estimation products, such as global product, regional product, and near real-time product, is performed. Overall, the methodology developed here shows almost the same results as regional product methods and exhibits better results than near real-time and global product methods.
Read full abstract