Abstract

Landslide is one of the natural disasters that cause property damages and human injuries. Landslide hazard predictions are crucial measures to reduce the damages and losses. One of the effective approaches in landslide prediction is landslide susceptibility analysis (LSA). In this article, LSA is carried out on the study area, Penang Island. The imbalanced landslide dataset is the most important issue to be solved in this article, four resampling methods were compared for the training set using random forest (RF) as the basic model. To enhance the credibility of the results, the experiments replicate 10 times, and McNemar’s test was applied to analyse statistical significance of classifier performances for the LSA. The results indicated that the differences between the methods were statistically significant; RF combined with the synthetic minority oversampling technique-edited nearest neighbour (SMOTE-ENN) resampling method proposed in this paper has positive effect in LSA as compared with the other resampling methods. The RF and SMOTE-ENN combined model for the LSA using the min–max normalization method achieved a recall of 0.844 and an F2-score of 0.756. The SMOTE-ENN method had a significant impact on the LSA of the imbalanced data in the study area.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.