Abstract

• Assessment of heavy metal pollution in soil base on hyperspectral image. • Estimating the soil heavy metal content by constructing ensemble learning model. • The estimation accuracy of heavy metal content in soil is improved. • Mapping the distribution trend of heavy metals in farmland soil. • The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Heavy metal pollution poses a huge challenge to the soil environment. With the increasing pollution level, the traditional monitoring methods cannot quickly obtain information on large-area pollution. Therefore, a large-scale mapping method with high precision is urgently needed to effectively control heavy metal pollution. This study explored a method for mapping soil heavy metal concentrations through hyperspectral images. On this basis, a new Stacked AdaBoost ensemble learning algorithm was constructed to construct the inversion model of soil heavy metal contents. The characteristic spectral bands of heavy metals were extracted as model input variables using Pearson’s correlation coefficient and successive projections algorithm. With three sets of heavy metal content data, the prediction accuracy and mapping outcomes of various machine learning methods were compared. Furthermore, the potential sources of heavy metal pollution in the study area were analyzed based on the Moran’s index. The results showed that the Stacked AdaBoost model was relatively stable with higher accuracy than traditional machine learning models. For Cr, Cu, and As, the determination coefficients ( R 2 ) of the verification set were 0.66, 0.61, and 0.74, respectively. Afterward, the results of this model were used to map the heavy metal concentration over the study area. The mapping results suggested that the heavy metal conditions of soils in the Ganhetan area were caused by nature and human activities. The As pollution in agricultural soils was the most serious, with an exceedance rate of 38.66%. Industrial areas were potential sources of soil heavy metal pollution in the study area. In summary, the Stacked AdaBoost ensemble learning model provides detailed and reliable data for agricultural ecological protection and industrial pollution control, allowing the effective management of heavy metal pollution sources.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call