ABSTRACT The significance of soil in agriculture and human survival cannot be overstated. Given its limited supply, improving soil properties is imperative. This study addresses this challenge by predicting five major soil properties—organic carbon, pH values, Mehlich-3 extractable calcium, Mehlich-3 extractable phosphorus, and sand content—utilizing mid-infrared absorbance measurements from the African Soil Information Service (AfSIS) dataset, covering non-desert regions of Africa. With a pressing need for a reliable and efficient model to predict African soil properties using spectral measurements, this study fills a crucial gap, addressing the scarcity of functional soil property databases in Africa. The developed model eliminates costly soil sample preparation and lengthy chemical analysis, applicable in both onsite and laboratory settings for determining soil functional properties. By employing stacked autoencoders for feature dimensionality reduction and combining discrete wavelet transform with two feature selection methods (stepwise regression and random forest) to build robust multi-layer perceptron (MLP) models, the study offers a comprehensive approach. Evaluation metrics including root mean square error (RMSE), scatter index (SI), Variance Accounted For (VAF), Nash-Sutcliffe Model Efficiency (NSE), and Correlation Coefficient (R) demonstrate the superior performance of the MLP model informed by stacked autoencoder-selected features, outperforming models informed by wavelet-transformed features and partial least square regression. This best-performing autoencoder-based model presents a valuable tool for soil scientists tasked with modeling African soil properties.
Read full abstract