Modeling spectral reflectance data using machine learning algorithms presents a promising approach for estimating soil attributes. Nevertheless, a comprehensive investigation of the most effective models, parameters, wavelengths, and data acquisition techniques is essential to ensure optimal predictive accuracy. This work aimed to (a) explore the potential of the soil spectral signature obtained in different spectral bands (VIS-NIR, SWIR, and VIS-NIR-SWIR) and, by using hyperspectral imaging and non-imaging sensors, in the predictive modeling of soil attributes; and (b) analyze the accuracy of different ML models in predicting particle size and soil organic carbon (SOC) applied to the spectral signature of different spectral bands. Six soil monoliths, located in the central north region of Parana, Brazil, were collected and scanned via hyperspectral cameras (VIS-NIR camera and SWIR camera) and spectroradiometer (VIS-NIR-SWIR) in the laboratory. The spectral signature of the soils was analyzed and subsequently applied to ML models to predict particle size and SOC. Each set of data obtained by the different sensors was evaluated separately. The algorithms used were k-nearest neighbors (KNN), support vector machine (SVM), random forest (RF), linear regression (LR), artificial neural network (NN), and partial least square regression (PLSR). The most promising predictive performance was observed for the complete VIS-NIR-SWIR spectrum, followed by SWIR and VIS-NIR. Meanwhile, KNN, RF, and NN models were the most promising algorithms in estimating soil attributes for the dataset obtained from both sensors. The general mean R2 (determination coefficient) values obtained using these models, considering the different spectral bands evaluated, were around 0.99, 0.98, and 0.97 for sand prediction, and around 0.99, 0.98, and 0.96 for clay prediction. The lower performances, obtained for the datasets from both sensors, were observed for silt and SOC, with R2 results between 0.40 and 0.59 for these models. KNN demonstrated the best predictive performance. Integrating effective ML models with robust sample databases, obtained by advanced hyperspectral imaging and spectroradiometers, can enhance the accuracy and efficiency of soil attribute prediction.
Read full abstract