Soil Organic Carbon (SOC) fixation plays a vital role in reducing greenhouse gas emissions and the formation of soil aggregates is recognized as a crucial process in SOC fixation. In the current study, we characterized spectral information on SOC and aggregate Organic carbon (OC) components by using a rapid and robust method. Soil samples were collected from different layers to obtain the SOC and aggregate OC components to record corresponding spectra. The Continuous Wavelet Transform (CWT) spectra were then combined with various characteristic wavelength selection methods to build Multivariate Linear Regression (MLR), BP neural network (BP), and Random Forest (RF) models. Finally, the optimal combination of characteristic wavelengths based on the best-performing model for each index was determined. The results demonstrated that the model's accuracy, which was built using the set of characteristic wavelengths of the best model for all metrics, was higher than that of the best model for a single metric. This was due to the correlation between SOC and aggregate OC components. The best set of characteristic wavelengths were those selected by the Sequential Backward Selection method with wavelet decomposition scales of 23, 26, and 27, and those selected by the Minimum Redundancy Maximum Relevance method with wavelet decomposition scale 27. The RF model for Free silt and clay particles organic carbon (FSP-OC) showed the highest accuracy among the four SOC prediction models. In addition, it was discovered that as the wavelet decomposition scales become more similar, the extracted characteristic wavelengths also become more similar, resulting in similar model predictions. In summary, the accuracy of the predictive model for SOC and its constituents can be demonstrably improved by reasonable combination of characteristic wavelengths selection method and wavelet decomposition scale.
Read full abstract