Large-scale mapping of soil resources can be crucial and indispensable for several of the managerial applications and policy implications. With machine learning models being the most utilized modeling technique for digital soil mapping (DSM), the implementation of model-based deep learning methods for spatial soil predictions is still under scrutiny. In this study, soil continuous (pH and OC) and categorical variables (order and suborder) were predicted using deep learning–multi layer perceptron (DL-MLP) and one-dimensional convolutional neural networks (1D-CNN) for the entire state of Tamil Nadu, India. For training the deep learning models, 27,098 profile observations (0–30 cm) were extracted from the generated soil database, considering soil series as the distinctive stratum. A total of 43 SCORPAN-based environmental covariates were considered, of which 37 covariates were retained after the recursive feature elimination (RFE) process. The validation and test results obtained for each of the soil attributes for both the algorithms were most comparable with the DL-MLP algorithm depicting the attributes’ most intricate spatial organization details, compared to the 1D-CNN model. Irrespective of the algorithms and datasets, the R2 and RMSE values of the pH attribute ranged from 0.15 to 0.30 and 0.97 to 1.15, respectively. Similarly, the R2 and RMSE of the OC attribute ranged from 0.20 to 0.39 and 0.38 to 0.42, respectively. Further, the overall accuracy (OA) of the order and suborder classification ranged from 39% to 67% and 35% to 64%, respectively. The explicit quantification of the covariate importance derived from the permutation feature importance implied that both the models tried to incorporate the covariate importance with respect to the genesis of the soil attribute under study. Such approaches of the deep learning models integrating soil–environmental relationships under limited parameterization and computing costs can serve as a baseline study, emphasizing opportunities in increasing the transferability and generalizability of the model while accounting for the associated environmental dependencies.
Read full abstract