Abstract
Acquiring soil nutrient content quickly and accurately through remote sensing is the key to advance precision agriculture. The development of deep learning has provided new technical means for soil hyperspectral modeling. However, the problem of poor interpretability of deep learning models limits its development. Although SHapley Additive exPlanations (SHAP) values based on game theory have been successfully applied to the interpretation of deep learning soil spectral modeling, whether they can accurately explain the differences in deep learning model accuracy remains to be verified. Based on this, we explored whether SHAP values can accurately explain the differences in convolutional neural network (CNN) modeling accuracy. We collected soil samples from agricultural land in the Liangshui River Basin in the southern mountainous and hilly areas of China, and measured the soil total nitrogen (STN) content and soil spectral data in the laboratory. We compared the effects of full-spectrum and feature-spectrum on the accuracy of deep learning models, and obtained the contribution of wavelengths in the CNN modeling process by calculating SHAP values. The results showed that combining different spectral pre-processing methods can play their respective advantages and help improve modeling accuracy. Among them, the CNN model obtained the highest prediction accuracy under the first-derivative Savitzky-Golay smoothing combination standard normal variate (SG1-SNV) spectral pre-processing in full-spectrum modeling. Compared with the feature-spectrum selected for modeling by Mutual information (MI) and competitive adaptive reweighted sampling (CARS), the CNN model achieved higher accuracy in most pre-processed spectra in full-spectrum modeling, and SHAP values accurately explained this reason. This is because the contribution is usually higher at most wavelengths with a high correlation with STN content. The feature-spectrum selected by CARS is more widely distributed but lacks continuity, and some wavelengths with high correlation and high contribution will also be missed. Meanwhile, some wavelengths with low correlation also have high contributions, which are usually not involved in the feature spectrum modeling of MI, thus affecting the modeling accuracy. Therefore, the deep learning model is more suitable for full-spectrum modeling due to its strong feature extraction and self-learning capabilities, and SHAP can obtain the wavelength contribution of the CNN model in soil spectral modeling, and then explain the differences in modeling accuracy. This study further proves the interpretability of deep learning, provides an important basis for the application of deep learning in soil hyperspectral modeling.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.