Abstract

Floods are among the most destructive natural hazards. Therefore, their prediction is pivotal for flood management and public safety. Factors contributing to flood are different for every watershed as they depend upon the characteristics of each watershed. Therefore, this study evaluated the factors contributing to flood and the precise location of high and very high flood susceptibility regions in Karachi. A new ensemble model (LR-SVM-MLP) is introduced to develop the susceptibility map and evaluate influencing factors. This ensemble model was formed by employing a stacking ensemble on Logistic Regression (LR), Support Vector Machine (SVM), and Multi-Layer Perceptron (MLP). A spatial database was generated for the Karachi watershed, which included; twelve conditioning factors as independent variables, 652 flood points and the same number of non-flood points as dependent variables. This data was then randomly divided into 70% and 30% to train and validate models, respectively. To analyse the collinearity among factors and to scrutinize each variable's predictive power, multicollinearity test and Information Gain Ratio were applied, respectively. After training, the models were evaluated on various statistical measures and compared with benchmark models. Results revealed that the proposed ensemble model outperformed Logistic Regression (LR), Support Vector Machine (SVM), and Multi-Layer Perceptron (MLP) and produced a precise and accurate map. Results of ensemble model showed 99% accuracy in training and 98% accuracy in testing datasets. This ensemble model can be used by flood management authorities and the government to contribute to future research studies.

Highlights

  • Floods are considered destructive natural hazards that causes millions of human deaths and result in billions of economic losses worldwide

  • A new ensemble model (LR-SVM-Multi-Layer Perceptron (MLP)) is introduced to develop the susceptibility map and evaluate influencing factors. This ensemble model was formed by employing a stacking ensemble on Logistic Regression (LR), Support Vector Machine (SVM), and Multi-Layer

  • Results revealed that the proposed ensemble model outperformed Logistic Regression (LR), Support Vector Machine (SVM), and Multi-Layer

Read more

Summary

Introduction

Floods are considered destructive natural hazards that causes millions of human deaths and result in billions of economic losses worldwide. The flood threat cannot be put aside because of future climate change Odoh and Chilaka (2012). It is predicted that the majority of the world will be threatened by the flood frequency and intensity in future. Information about inundation can be obtained through different techniques of remote sensing, either airborne or space-borne Schumann and Moller (2015). The generation of susceptibility maps for flash floods is challenging, especially in large regions, as flash floods are quite complicated because they are area-dependent and arise nonlinearly to the variety of Spatio-temporal scales (Ahmadlou et al 2019). Promising results of machine learning models have been reported in literature worldwide for solving problems related to natural hazards

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call