Abstract

The heavy metal pollution in arable soil poses a significant threat to human health. Thus, it is of great significance to investigate the contamination of heavy metal elements in the soil. As the soil polluted by heavy metal is sensitive to spectral reflectance, thus the hyperspectral remote sensing technology could be a valuable tool for retrieving heavy metal components in the soil. This study, taking chromium (Cr) concentration as an example, proposes an optimal model for estimating heavy metal components in the soil by comprehensively taking account of the spectral pretreatment, dimensionality reduction with optimal parameters, and hyperspectral model. To this end, both the linear model, i.e., partial least squares regression (PLSR), and the nonlinear model, i.e., the gradient boosting decision tree (GBDT), are applied in this study. It is found in the study area, the Savitzky-Golay (SG) method can be regarded as an excellent spectral pretreatment for the hyperspectral data regardless of the applied model. By contrast, the dimensionality reduction in terms of the Principal Component Analysis (PCA) is closely related to hyperspectral model: the optimal principal components (PCs) in the estimation of Cr concentration are the first 9 PCs for the GBDT (nonlinear model), while that for the PLSR (linear model) become the first 8 PCs. Moreover, the examination of hyperspectral model shows the GBDT model has slightly better performance than the PLSR model for the Cr concentration estimation under most conditions. Finally, when the spectral pretreatment, dimensionality reduction, and hyperspectral model are fully considered, the best retrieval model for the Cr concentration in the study area is the SG-PCA-GBDT model. Numeric measures of model accuracy show the proposed model has a determination coefficient of 0.80 and a residual prediction deviation of 2.04, which provides a potentially new method for estimating Cr concentration in the polluted soil.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call