Understanding the spatial distribution of topsoil properties in grassland ecosystems is essential for improving soil ecosystem services, quality, and erosion resilience. The availability of free, high-resolution satellite imagery and advanced data mining techniques offers new opportunities for efficient soil property assessment. This study aimed to evaluate the potential utility of multi-season PlanetScope imagery to predict soil organic carbon (SOC), pH, and calcium carbonate (CaCO3). Using random sampling, 121 topsoil samples (0–30 cm depth) were collected with an auger across grasslands, bare soil, and eroded areas within a typical grazing land use. Three data mining techniques: random forest (RF), extreme gradient boosting (XGB), and support vector machines (SVM), were applied and evaluated using a 10-fold cross-validation. The results indicated that multi-season spectral covariates considerably improved the accuracy of the target soil properties compared to single-season imagery. SVM was the most effective algorithm for predicting SOC, achieving a root mean square error (RMSE) of 0.52%, mean absolute error (MAE) of 0.24%, and R² of 0.92. RF was the best-performing algorithm for predicting soil pH (RMSE = 0.22, MAE = 0.17, and R² = 0.97) and CaCO3 (RMSE = 0.55%, MAE = 0.42%, and R² = 0.96). While XGB failed to capture the variability in soil pH, the other models generated interpretable maps that accurately represented the distribution of soil properties across different land cover categories. The green-red vegetation index (GRVI) was the most critical covariate for predicting SOC, while elevation and the topographic wetness index (TWI) were key predictors for soil pH and CaCO3, respectively. This study underscores the potential of multi-season PlanetScope imagery for accurately predicting soil properties and recommends conducting similar studies in diverse geographical settings to validate these findings and develop more generalizable models.
Read full abstract