This research explores flood prediction in the Lower Damodar River Basin (LDRB) using a hybrid ensemble of a Naïve Bayes Tree (NBT) and five bivariate statistical models such as Evidential Belief Function (EBF), Index of Entropy (IOE), Frequency Ratio (FR), Statistical Index (SI), and Modified Information Value (MIV). A total of 348 flood locations and 15 conditioning factors including hydrological, topographical and land cover were considered for this analysis. To ensure the precision of model predictions, a multicollinearity assessment was executed. Receiver operating characteristic (ROC) curve, area under curve (AUC), mean absolute error (MAE), mean squared error (MSE), and root mean squared error (RMSE) were performed to compare and asses each of the models all. The results reveal that all models performed well in creating flood hazard maps with AUC >0.8 and RMSE <0.4. FR and EBF models demonstrated the highest predictive accuracy (AUC=0.85), followed by the IOE, SI, and MIV models. The ensemble of NBT with bivariate models shows promising results, showcasing reduced error metrics and improved accuracy for the IOE, SI, and MIV models. This study highlights the potential of ensemble models in flood hazard prediction, offering valuable insights for global flood risk management. The successful application of these data-driven models showcases their importance in forecasting flood risks, aiding decision-makers and planners in developing more effective flood mitigation strategies.
Read full abstract