Abstract

The content of soil organic carbon (SOC) plays an important role in maintaining ecosystem functions, protecting soil biodiversity, and understanding carbon cycling processes. The combination of visible and near-infrared spectroscopy (VIS–NIRS) and machine learning can achieve rapid prediction of SOC content. However, it is still relatively unknown how to integrate the characteristics of various machine learning models to improve the performance of SOC prediction models. In this study, a new model for predicting SOC content based on stacking ensemble learning was proposed by using VIS–NIRS. The prediction performances of six different models including Support Vector Regression (SVR), Extreme Gradient Boosting (XGBoost), Random Forest (RF), Light Gradient Boosting Machine (LightGBM), PartialLeast-square (PLS) and Extreme Learning Machine (ELM) on SOC content under different spectral preprocessing methods were compared. The results indicated that SVR, XGBoost, and LightGBM models provide better prediction performance after first-order derivative preprocessing. After comparing the performance of various combinations of base models applied to the first layer of a stacking ensemble model, the results showed that both the combination of XGBoost, LightGBM, and SVR models and the combination of SVR, ELM, and LightGBM models achieve the best performance. The coefficient of determination (R2) of the stacking ensemble model on the test set reaches 0.84, which improves the accuracy of the model compared with the traditional single model. The stability of the stacking ensemble model was verified by applying it to datasets of different sizes, which can replace traditional machine learning models in predicting SOC content.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.