Abstract

Landslides, the second largest geological hazard after earthquakes, result in significant loss of life and property. Extracting landslide information quickly and accurately is the basis of landslide disaster prevention. Fengjie County, Chongqing, China, is a typical landslide-prone area in the Three Gorges Reservoir Area. In this study, we newly integrate Shapley Additive Explanation (SHAP) and Optuna (OPT) hyperparameter tuning into four basic machine learning algorithms: Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Additive Boosting (AdaBoost). We construct four new models (SHAP-OPT-GBDT, SHAP-OPT-XGBoost, SHAP-OPT-LightGBM, and SHAP-OPT-AdaBoost) and apply the four new models to landslide extraction for the first time. Firstly, high-resolution remote sensing images were preprocessed, landslide and non-landslide samples were constructed, and an initial feature set with 48 features was built. Secondly, SHAP was used to select features with significant contributions, and the important features were selected. Finally, Optuna, the Bayesian optimization technique, was utilized to automatically select the basic models’ best hyperparameters. The experimental results show that the accuracy (ACC) of these four SHAP-OPT models was above 92% and the training time was less than 1.3 s using mediocre computational hardware. Furthermore, SHAP-OPT-XGBoost achieved the highest accuracy (96.26%). Landslide distribution information in Fengjie County from 2013 to 2020 can be extracted by SHAP-OPT-XGBoost accurately and quickly.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call