Landslide Data Research Articles

Landslide susceptibility mapping has significantly progressed with improvements in machine learning techniques. However, the inventory/data imbalance (DI) problem remains one of the challenges in this domain. This problem exists as a good quality landslide inventory map, including a complete record of historical data, is difficult or expensive to collect. As such, this can considerably affect one’s ability to obtain a sufficient inventory or representative samples. This research developed a new approach based on generative adversarial networks (GAN) to correct imbalanced landslide datasets. The proposed method was tested at Chukha Dzongkhag, Bhutan, one of the most frequent landslide prone areas in the Himalayan region. The proposed approach was then compared with the standard methods such as the synthetic minority oversampling technique (SMOTE), dense imbalanced sampling, and sparse sampling (i.e., producing non-landslide samples as many as landslide samples). The comparisons were based on five machine learning models, including artificial neural networks (ANN), random forests (RF), decision trees (DT), k-nearest neighbours (kNN), and the support vector machine (SVM). The model evaluation was carried out based on overall accuracy (OA), Kappa Index, F1-score, and area under receiver operating characteristic curves (AUROC). The spatial database was established with a total of 269 landslides and 10 conditioning factors, including altitude, slope, aspect, total curvature, slope length, lithology, distance from the road, distance from the stream, topographic wetness index (TWI), and sediment transport index (STI). The findings of this study have shown that both GAN and SMOTE data balancing approaches have helped to improve the accuracy of machine learning models. According to AUROC, the GAN method was able to boost the models by reaching the maximum accuracy of ANN (0.918), RF (0.933), DT (0.927), kNN (0.878), and SVM (0.907) when default parameters used. With the optimum parameters, all models performed best with GAN at their highest accuracy of ANN (0.927), RF (0.943), DT (0.923) and kNN (0.889), except SVM obtained the highest accuracy of (0.906) with SMOTE. Our finding suggests that RF balanced with GAN can provide the most reasonable criterion for landslide prediction. This research indicates that landslide data balancing may substantially affect the predictive capabilities of machine learning models. Therefore, the issue of DI in the spatial prediction of landslides should not be ignored. Future studies could explore other generative models for landslide data balancing. By using state-of-the-art GAN, the proposed model can be considered in the areas where the data are limited or imbalanced.

China is one of the countries where landslides caused the most fatalities in the last decades. The threat that landslide disasters pose to people might even be greater in the future, due to climate change and the increasing urbanization of mountainous areas. A reliable national-scale rainfall induced landslide susceptibility model is therefore of great relevance in order to identify regions more and less prone to landsliding as well as to develop suitable risk mitigating strategies. However, relying on imperfect landslide data is inevitable when modelling landslide susceptibility for such a large research area. The purpose of this study is to investigate the influence of incomplete landslide data on national scale statistical landslide susceptibility modeling for China. In this context, it is aimed to explore the benefit of mixed effects modelling to counterbalance associated bias propagations. Six influencing factors including lithology, slope, soil moisture index, mean annual precipitation, land use and geological environment regions were selected based on an initial exploratory data analysis. Three sets of influencing variables were designed to represent different solutions to deal with spatially incomplete landslide information: Set 1 (disregards the presence of incomplete landslide information), Set 2 (excludes factors related to the incompleteness of landslide data), Set 3 (accounts for factors related to the incompleteness via random effects). The variable sets were then introduced in a generalized additive model (GAM: Set 1 and Set 2) and a generalized additive mixed effect model (GAMM: Set 3) to establish three national-scale statistical landslide susceptibility models: models 1, 2 and 3. The models were evaluated using the area under the receiver operating characteristics curve (AUROC) given by spatially explicit and non-spatial cross-validation. The spatial prediction pattern produced by the models were also investigated. The results show that the landslide inventory incompleteness had a substantial impact on the outcomes of the statistical landslide susceptibility models. The cross-validation results provided evidence that the three established models performed well to predict model-independent landslide information with median AUROCs ranging from 0.8 to 0.9. However, although Model 1 reached the highest AUROCs within non-spatial cross-validation (median of 0.9), it was not associated with the most plausible representation of landslide susceptibility. The Model 1 modelling results were inconsistent with geomorphological process knowledge and reflected a large extent the underlying data bias. The Model 2 susceptibility maps provided a less biased picture of landslide susceptibility. However, a lower predicted likelihood of landslide occurrence still existed in areas known to be underrepresented in terms of landslide data (e.g., the Kuenlun Mountains in the northern Tibetan Plateau). The non-linear mixed-effects model (Model 3) reduced the impact of these biases best by introducing bias-describing variables as random effects. Among the three models, Model 3 was selected as the best national-scale susceptibility model for China as it produced the most plausible portray of rainfall induced landslide susceptibility and the highest spatially explicit predictive performance (median AUROC of spatial cross validation 0.84) compared to the other two models (median AUROCs of 0.81 and 0.79, respectively). We conclude that ignoring landslide inventory-based incompleteness can entail misleading modelling results and that the application of non-linear mixed-effect models can reduce the propagation of such biases into the final results for very large areas.

Landslide Data Research Articles

Related Topics

Articles published on Landslide Data

A New Integrated Approach for Landslide Data Balancing and Spatial Prediction Based on Generative Adversarial Networks (GAN)

Landslide Susceptibility Zonation of Rongga District and Surrounding Areas Using Weight of Evidence (WoE) Method

Evaluating methods for debris-flow prediction based on rainfall in an Alpine catchment

Mechanism and Stability Analysis of Deformation Failure of a Slope

Landslide Susceptibility Mapping of Urban Areas: Logistic Regression and Sensitivity Analysis applied to Quito, Ecuador

Large-Scale Landslide Susceptibility Mapping Using an Integrated Machine Learning Model: A Case Study in the Lvliang Mountains of China

Counteracting flawed landslide data in statistically based landslide susceptibility modelling for very large areas: a national-scale assessment for Austria

Rapid Terrain Assessment for Earthquake-Triggered Landslide Susceptibility With High-Resolution DEM and Critical Acceleration

Probing the Crisis of Regional Connectivity Instigated by the Natural Disasters, Mizoram, India

GIS-Based Landslide Susceptibility Mapping using Logistic Regression, Instability Index, and Support Vector Machine: Case Study of the Jingshan River, Taiwan

Temporal Variations in Landslide Distributions Following Extreme Events: Implications for Landslide Susceptibility Modeling

Analysis of Landslide Susceptibility Using Deep Neural Network

National-scale data-driven rainfall induced landslide susceptibility mapping for China by accounting for incomplete landslide data

Comparing methods for determining landslide early warning thresholds: potential use of non-triggering rainfall for locations with scarce landslide data availability

Landslides susceptibility mapping based on geospatial data and geomorphic attributes (a case study: Pacet, Mojokerto, East Java)

GIS-based comparative study of Bayes network, Hoeffding tree and logistic model tree for landslide susceptibility modeling

Derivation of earthquake-induced landslide distribution using aerial photogrammetry: the January 24, 2020, Elazig (Turkey) earthquake

Spatio-temporal variability of monsoon precipitation and their effect on precipitation triggered landslides in relation to relief in Himalayas

Model performance analysis for landslide susceptibility in cold regions using accuracy rate and fluctuation characteristics

Assessment of Submarine Landslide Susceptibility in the Sea Area of Zhoushan

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Landslide Data Research Articles

Related Topics

Articles published on Landslide Data

A New Integrated Approach for Landslide Data Balancing and Spatial Prediction Based on Generative Adversarial Networks (GAN)

Landslide Susceptibility Zonation of Rongga District and Surrounding Areas Using Weight of Evidence (WoE) Method

Evaluating methods for debris-flow prediction based on rainfall in an Alpine catchment

Mechanism and Stability Analysis of Deformation Failure of a Slope

Landslide Susceptibility Mapping of Urban Areas: Logistic Regression and Sensitivity Analysis applied to Quito, Ecuador

Large-Scale Landslide Susceptibility Mapping Using an Integrated Machine Learning Model: A Case Study in the Lvliang Mountains of China

Counteracting flawed landslide data in statistically based landslide susceptibility modelling for very large areas: a national-scale assessment for Austria

Rapid Terrain Assessment for Earthquake-Triggered Landslide Susceptibility With High-Resolution DEM and Critical Acceleration

Probing the Crisis of Regional Connectivity Instigated by the Natural Disasters, Mizoram, India

GIS-Based Landslide Susceptibility Mapping using Logistic Regression, Instability Index, and Support Vector Machine: Case Study of the Jingshan River, Taiwan

Temporal Variations in Landslide Distributions Following Extreme Events: Implications for Landslide Susceptibility Modeling

Analysis of Landslide Susceptibility Using Deep Neural Network

National-scale data-driven rainfall induced landslide susceptibility mapping for China by accounting for incomplete landslide data

Comparing methods for determining landslide early warning thresholds: potential use of non-triggering rainfall for locations with scarce landslide data availability

Landslides susceptibility mapping based on geospatial data and geomorphic attributes (a case study: Pacet, Mojokerto, East Java)

GIS-based comparative study of Bayes network, Hoeffding tree and logistic model tree for landslide susceptibility modeling

Derivation of earthquake-induced landslide distribution using aerial photogrammetry: the January 24, 2020, Elazig (Turkey) earthquake

Spatio-temporal variability of monsoon precipitation and their effect on precipitation triggered landslides in relation to relief in Himalayas

Model performance analysis for landslide susceptibility in cold regions using accuracy rate and fluctuation characteristics

Assessment of Submarine Landslide Susceptibility in the Sea Area of Zhoushan