Abstract. In the domain of landslide risk science, landslide susceptibility mapping (LSM) is very important, as it helps spatially identify potential landslide-prone regions. This study used a statistical ensemble model (frequency ratio and evidence belief function) and two machine learning (ML) models (random forest and XGBoost; eXtreme Gradient Boosting) for LSM in the province of Belluno (region of Veneto, northeastern Italy). The study investigated the importance of the conditioning factors in predicting landslide occurrences using the mentioned models. In this paper, we evaluated the importance of the conditioning factors in the overall prediction capabilities of the statistical and ML algorithms. By the trial-and-error method, we eliminated the least “important” features by using a common threshold of 0.30 for statistical and 0.03 for ML algorithms. Conclusively, we found that removing the least important features does not impact the overall accuracy of LSM for all three models. Based on the results of our study, the most commonly available features, for example, the topographic features, contributes to comparable results after removing the least important ones, namely the aspect plan and profile curvature, topographic wetness index (TWI), topographic roughness index (TRI), and normalized difference vegetation index (NDVI) in the case of the statistical model and the plan and profile curvature, TWI, and topographic position index (TPI) for ML algorithms. This confirms that the requirement for the important conditioning factor maps can be assessed based on the physiography of the region.
Read full abstract