Abstract
Accurately and quickly estimating the soil organic carbon (SOC) content is crucial in the monitoring of global carbon. Environmental variables play a significant role in improving the accuracy of the SOC content estimation model. This study focuses on modeling methodologies and environmental variables, which significantly influence the SOC content estimation model. The modeling methods used in this research comprise multiple linear regression (MLR), partial least squares regression (PLSR), random forest, and support vector machines (SVM). The analyzed environmental variables include terrain, climate, soil, and vegetation cover factors. The original spectral reflectance (OSR) of Landsat 5 TM images and the spectral reflectivity after the derivative processing were combined with the above environmental variables to estimate SOC content. The results showed that: (1) The SOC content can be efficiently estimated using the OSR of Landsat 5 TM, however, the derived processing method cannot significantly improve the estimation accuracy. (2) Environmental variables can effectively improve the accuracy of SOC content estimation, with climate and soil factors producing the most significant improvements. (3) Machine learning modeling methods provide better estimation accuracy than MLR and PLSR, especially the SVM model which has the highest accuracy. According to our observations, the best estimation model in the study area was the “OSR + SVM” model (R2 = 0.9590, RMSE = 13.9887, MAE = 10.8075), which considered four environmental factors. This study highlights the significance of environmental variables in monitoring SOC content, offering insights for more precise future SOC assessments. It also provides crucial data support for soil health monitoring and sustainable agricultural development in the study area.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have