Abstract

The past landslides from landslide inventory are essential for machine learning (ML)-based landslide susceptibility assessment (LSA). They determine not only the positive samples (characterized by the past landslides) but also the negative samples (randomly generated based on past landslides) for training and validating the ML models. However, the number of past landslides is often limited because of the constraints of time, budget, and resources available, etc. In other words, the available data for establishing landslide susceptibility ML models are limited, which indicates that the accuracy and reliability of the corresponding models are insufficient. This article, therefore, proposes using a new landslide susceptibility index-based sampling strategy to enhance the positive and negative samples for model training and validation to reach an improved ML-based LSA. To realize this idea, landslide susceptibility analysis based on initial datasets compiled from landslide inventory by using three ML models, i.e., random forest (RF), gradient boosting decision tree (GBDT) and support vector machine (SVM), are first conducted to obtain the initial landslide susceptibility indices at different space locations. Then, the landslide susceptibility indices are analyzed with the proposed sampling strategy which considers directly the grid units with very high and very low landslide susceptibility indices as potential positive and negative samples, respectively, to enrich the initial dataset; and the number of these positive/negative samples is determined by a Bayesian optimization algorithm. Thereafter, the ML models are updated with the enriched datasets. Finally, to verify the effectiveness of the proposed strategy, the improved models are applied to assess the landslide susceptibility of Taojiang County, China, and the results are compared with those from initial corresponding models without updating. The results show that compared with the initial RF, GBDT and SVM models, the corresponding improved models have a better performance in accuracy, precision, recall rate, and specificity. In particular, the AUC values of the three models are increased by 11.94%, 10.72% and 0.57%, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.