Abstract

The unique characteristics of drainage conditions in the Pagla river basin cause flooding and harm the socioeconomic environment. The main purpose of this study is to investigate the comparative utility of six machine learning algorithms to improve flood susceptibility and ensemble techniques' capability to elucidate the underlying patterns of floods and make a more accurate prediction of flood susceptibilities in the Pagla river basin. In the present scenario, the frequency of flood conditions in this study area becomes high with heavy and sudden rainfall, so it is essential to study flood mitigation and measure. At first, a spatial flood database was built with 200 flood locations and sixteen flood influencing factors, and its process with the help of the Geographic Information System (GIS) environment and build up different models applying the machine learning techniques. It has found different flood susceptibility zone using machine learning-based Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF), Reduced Error Pruning Tree (REPTree), Logistic Regression (LR), and Bagging helping GIS environment and the model validation using the Receiver Operating Characteristic (ROC) Curve. Afterward, ensemble all the models to gate comparative accuracy of the flood zone. The calculated areas are under the very high flood susceptibility zone 8.69%, 14.92%, 14.17%, 12.98%, 14.65%, 13.24% and 13.41% for ANN, SVM, RF, REPTree, LR and Bagging, respectively. Finally, ROC curve, the Standard Error (SE), and the Confidence Interval (CI) at 95 per cent were used to assess and compare the performance of the models. The obtained results indicate that all models are highly accepted Area Under Curve (AUC) of ROC between 0.889 (LR) to 0.926 (Ensemble). From the estimation of the accuracy of the applied methods using ROC, it is found that the Ensemble model has the higher capability compared to the other applied models in projecting flood susceptibility in ​the ​study ​area. It has the highest area under the ROC curve the AUC values are 0.918 and 0.926, the SE (0.023, 034), and the narrowest CI (95 per cent) (0.873–0.962, 0.859–0.993) whereas highest area under Bagging (the ROC) curve (AUC) value (0.914, 0.919), for both the training and validation datasets. After ensembling, the result shows that the result is a highly flood susceptible area located at the lower part of the study area. In this area, the very high flood susceptibility zone values lie between 4.46 and 6.00 in the ensemble result. The areas comprise the low height and belong to Murarai I, Murarai II, Suti I and Suti II C.D. block of West Bengal. The current study will help the policymakers and the researcher determine the flood conditioning problems for prospects.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call