Abstract

Soil datasets with outliers lead to inaccurate farm-level digital soil mapping (DSM) results. Existing methods identify potential outliers in soil datasets based on expert experience or simple statistics that neglect the geographical characteristics of soil. In this paper, a novel potential outlier recognition method was developed from the perspective of geographical context. First, spatial search distance was automatically determined by the spatial distance among soil samples. Second, similarities of adjacent soil samples and the local spatial variation level were comprehensively considered to calculate outlier scores. Finally, a frequency histogram of outlier scores was generated to determine a suitable threshold for recognizing potential abnormal samples. To validate the proposed method, it was compared to Lambda and Box-Plot methods, and the ordinary kriging method was used to map five soil properties, including pH, soil organic matter, total nitrogen, available phosphorus and available potassium, in an agricultural region. Then, a synthetic study using artificially contaminated DEM data was also conducted. The comparative experiment shows that the proposed method is better able to recognize potential outliers by mining the local spatial structure, as indicated by lower mean absolute error (MAE) and root mean square error (RMSE) values. It can be concluded that consideration of local spatial autocorrelation and heterogeneity is helpful in recognizing potential outliers.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call