Abstract

This study applied two bivariate statistical models (frequency ratio and information value), one multivariate statistical model (logistic regression), and two supervised statistical learning models (boosted regression trees and classification and regression trees) for mapping flood proneness in an arid region of southern Iraq. For this purpose, ten flood causative factors were chosen based on data availability and local conditions along with the spatial extent of the large flood that affected the study area on 13 May 2013. The factors used involved topography-related factors (elevation, slope, curvature, topographic wetness index, and stream power index), lithology, soil, land use/land cover, the average of annual rainfall, and distance to rivers. The multicollinearity test proved that there was no multicollinearity problem among the factors used. Investigating the worth of factors in building the models using information gain ratio showed that the most important factors that play a major role in controlling flood proneness were elevation, followed by annual rainfall average, distance to rivers, land use/land cover, lithology, and soil. The models were employed using the most important factors to get flood proneness maps. The values of flood proneness were categorized into five classes using a quantile classification scheme. For validating the models, area under the receiver operating characteristic curve (AUC) was used. The AUC for prediction data set was 0.793, 0.786, 0.779, 0.754, and 0.753 for classification and regression trees, boosted regression trees, logistic regression, information value, and frequency ratio, respectively. For the best performance model (classification and regression trees), the areas occupied by flood proneness zones were 2735 km2, 2809 km2, 2816 km2, 2732 km2, and 2801 km2, for very low, low, moderate, high, and very high flood proneness zones, respectively. The main conclusion is that the machine learning models are optimal in mapping flood proneness in the study area, followed by the multivariate and bivariate models. Decision makers and hydrologists for improved management of access floodwater and prevention of flood-related damages can adopt the flood proneness maps developed in this study.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call