The technology for estimating soil properties using visible and near-infrared spectroscopy has been maturing, with corresponding advances and breakthroughs in deep learning models. In this study, based on the large soil spectral library LUCAS, we explore the potential of encoder-decoder structures to improve convolutional neural network regression predictions. By introducing an encoder-decoder structure into the feature channels of a six-layer CNN model (TRNN model), we significantly enhanced the performance of shallow CNN models and successfully carried out regression predictions for seven soil properties. We employed IntegratedGradients, DeepLift, GradientShap, and DeepLiftShap methods to interpret the output of the TRNN model. Our TRNN model, built on raw spectra, demonstrated high accuracy in predicting multiple soil properties, outperforming residual architectures, LSTMs, various CNN architectures, and other traditional machine learning methods proposed in previous studies. We also investigated the impact of multi-task output structures (TRNN 1-M and TRNN M−M) and single-task output structures (TRNN 1-1) on model performance. For the TRNN model with an encoder-decoder structure, multi-task output structures resulted in a reduction in performance. The TRNN showed outstanding results in regression analysis of the seven soil properties selected in this study (cation exchange capacity, organic carbon content, calcium carbonate content, pH, clay content, silt content, and sand content), with R2 values exceeding 0.93 for all seven properties. Different soil characteristics correspond to different wavelengths, with multiple characteristic peaks commonly observed. This research convincingly demonstrates the enormous potential of combining large model architectures with traditional deep learning approaches for predicting soil properties, which could significantly advance precision agriculture.
Read full abstract