A robust discretization method of factor screening for landslide susceptibility mapping using convolution neural network, random forest, and logistic regression models

Zheng Zhao,Jianhua Chen

doi:10.1080/17538947.2023.2174192

Abstract

ABSTRACT The selection of discretization criteria and interval numbers of landslide-related environmental factors generally fails to quantitatively determine or filter, resulting in uncertainties and limitations in the performance of machine learning (ML) methods for landslide susceptibility mapping (LSM). The aim of this study is to propose a robust discretization criterion (RDC) to quantify and explore the uncertainty and subjectivity of different discretization methods. The RDC consists of two steps: raw classification dataset generation and optimal dataset extraction. To evaluate the robustness of the proposed RDC method, Lushan County of Sichuan Province in China was chosen as the study area to generate the LSM based on three datasets (optimal dataset, original dataset with continuous values, and statistical dataset) using three popular ML methods, namely, convolution neural network, random forest, and logistic regression. The results show that the areas under the receiver operating characteristic curve (AUCs) of the optimal dataset for the abovementioned ML models are 0.963, 0.961, and 0.930 which are higher than those of the original dataset (0.938, 0.947, and 0.900) and statistical dataset (0.948, 0.954, and 0.897). In conclusion, the RDC method can extract the more representative features from environmental factors and outperform the other conventional discretization methods.

Full Text