Abstract

Gully erosion is a severe environmental issue that poses threats to agriculture, human safety, habitats, infrastructure, and soil integrity. Selecting the right machine learning model is vital for accurate Gully Erosion Susceptibility Mapping (GESM) due to varying environmental hazard performance. This study employed a comparative analysis of two machine learning techniques, Random Forest (RF) and Extreme Gradient Boost (XGBoost), to develop a highly precise GESM for the Silabati watershed (India). The analysis incorporated 24 controlling factors and examined a dataset of 460 sample points, with equal representation of gullies and non-gullies. Variance Inflation Factors (VIF) and Information Gain Ratio (IGR) techniques were applied to assess multicollinearity test among the controlling factors. Lithology, elevation, distance from the road, LULC, geomorphology, rainfall, drainage density, and coarse fragments emerged as crucial factors in determining GESM. Statistical tests, including the Kappa index, Root Mean Square Error (RMSE), Accuracy (ACC), Mean Absolute Error (MAE), Coefficient of Determination (R2), and Receiver Operating Characteristic (ROC), were employed to evaluate the RF and XGB models on training and testing data. Both models demonstrated strong performance, with the XGBoost and RF models achieving ROC values of 84.1% and 83.2%, respectively. The applied quantile classification method resulted in the creation of five distinct GESMs, categorized as very high (VH), high (H), moderate (M), low (L), and very low (VL). We found that the very high GESM areas of the watershed were 4.10% in the XGBoost model and 4.61%. in the RF model. Out of the 26 sub-watersheds, the results have identified five sub-watersheds as highly prioritized for sustainable management. Therefore, the present study provides an accurate gully erosion identification using advanced models, offering valuable insights for policymakers to proper management implementation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call