This study derived 12 topographical and hydrological factors related to landslides from a 10-m digital elevation model. Three unsupervised machine learning algorithms were employed to distinguish between the features of landslide sources and runout areas for Typhoon Morakot. Two sampling strategies were designed in this study. The first strategy involved creating a data set by pairing landslide sources and runout areas on the basis of the size of polygonal areas recorded in the inventory and then exploring the effectiveness of feature separation with k-means++ as the primary method and EM (expectation maximization) and DBSCAN (Density-Based Spatial Clustering of Applications with Noise) as supplementary methods. Because of practical challenges associated with the inability to determine whether remote sensing results correspond to landslide sources or runout areas, the first sampling strategy was used to test the feasibility of the adopted algorithms. In the second sampling strategy, the effectiveness of feature separation was tested by dividing polygon samples into several intervals on the basis of their sizes, thereby verifying the practicability of the adopted algorithms in real-world operations. This study verified that combining unsupervised machine learning algorithms with topographic factors facilitates the effective separation of landslide sources and runout areas, thereby enhancing the quality of landslide inventories and reducing the cost of landslide inventory creation. Moreover, it indicated that meter-scale topographic data yield classification results similar to those obtained from manually created landslide inventories based on centimeter-scale stereo aerial photos. The proposed method effectively balances the cost and quality of landslide inventories.
Read full abstract