Abstract

AbstractIn order to accurately predict soil properties, various machine learning (ML) approaches and hybrid models constructed by integrating ML into regression kriging framework were used to predict and map arable land topsoil pH in Henan province, central China. Random forest (RF), cubist (Cu), support vector machine, artificial neural network, multiple linear regression, classification and regression trees (CART) and their hybrid models were compared for pH accuracy prediction. Among all standalone ML models, RF had the best predictive performance, in terms of the metrics employed in this study, followed by Cu, and CART was the worst. Compared with their ML counterparts, hybrid models could improve the accuracy of topsoil pH prediction to various extents. The accuracy improvement of the hybrid models constructed based on the simple ML was much greater than that based on the complex ensemble ML. Except for artificial neural network kriging , there was no significant difference between different hybrid models in the predicted results of topsoil pH. The outputs from the best predictive models showed that weak acidic soils and weak alkaline soils were the dominant arable soils in the study region, accounting for more than 30% and more than 50% of the total arable land area respectively, the topsoil pH of arable land in the north of the study area is generally higher than that in the south. Boruta variable selection revealed that altitude, climatic covariates closely related to soil moisture availability and some soil properties were the most critical factors affecting and controlling the topsoil pH of arable land.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call