ABSTRACT Soil depth is essential for eco-hydrological modeling, carbon storage estimation, and land evaluation, yet its spatial variation is often poorly understood and rarely mapped, particularly in complex landscapes with limited sample sizes. Digital soil mapping, employing various machine learning methods, has become crucial in predicting and mapping soil attributes. This study investigates soil depth distribution in Rajasthan and Gujarat, India, using machine learning techniques. A total of 37 environmental variables were examined to identify factors influencing soil depth, utilizing data from 3206 soil profiles. Five machine learning algorithms – Random Forest (RF), Cubist, Support Vector Machines (SVM), extreme gradient boosting (XGBoost), and Partial Least Squares Regression (PLSR) – were evaluated. MODIS, WorldClim, and Digital Elevation Model were used as environmental covariates. Among these, channel network base level, and valley depth were found to be the most significant factors in soil depth modeling. The RF model produced the best results, with an R2 of 0.53, RMSE of 25.19 cm, and MAE of 16.04 cm, outperforming Cubist and other models. Predicted soil depths ranged from 23 to 197 cm in Rajasthan and from 16 to 182 cm in Gujarat. The spatial pattern of soil depth closely matched expectations for the studied regions, demonstrating effective model and variable selection. However, uncertainty assessment revealed significant uncertainty in 60% of the study area due to low soil profile points. This study presents a promising approach to predict soil depth via digital mapping, aiding regional land use planning and potentially informing national and international land resource management strategies.
Read full abstract