This study used hyperspectral remote sensing to rapidly, economically, and non-destructively determine the soil iron oxide content of the Dinosaur Valley annular tectonic region of Lufeng, Yunnan Province. The laboratory determined the iron oxide content and original spectral reflectance (OR) in 138 surface soil samples. We first subjected the OR data to Savizky-Golay smoothing, followed by four spectral transformations-continuum removal reflectance, reciprocal logarithm reflectance, standard normal variate reflectance, and first-order differential reflectance-which improved the signal-to-noise ratio of the spectral curves and highlighted the spectral features. Then, we combined the correlation coefficient method (CC), competitive adaptive reweighting algorithm, and Boruta algorithm to screen out the characteristic wavelength. From this, we constructed the linear partial least squares regression model, nonlinear random forest, and XGBoost machine learning algorithms. The results show that the CC-Boruta method can effectively remove any noise and irrelevant information to improve the model's accuracy and stability. The XGBoost nonlinear machine learning algorithm model better captures the complex nonlinear relationship between the spectra and iron oxide content, thus improving its accuracy. This provides a relevant reference for the rapid and accurate inversion of iron oxide content in soil using hyperspectral data.
Read full abstract