Abstract

<p>Nitrate contamination in groundwater is affected by both anthropogenic activities and natural conditions, becoming one of the most prevalent problems worldwide. In this study, several machine learning methods including decision tree (DT), k nearest neighbors (KNN), logistic regression (LR), support vector machine (SVM), and extreme-gradient-boosted trees (Xgboost) were applied to predict the risk of groundwater nitrate contamination (NO3- > 50 mg L<sup>-1</sup>) in the riverside areas of lower reaches of Yangtze River, east China. The developed model included 13 hydrochemical parameters (K<sup>+</sup>, Na<sup>+</sup>, Ca<sup>2+</sup>, Mg<sup>2+</sup>, Cl<sup>-</sup>, SO<sub>4</sub><sup>2-</sup>, NH<sub>4</sub><sup>+</sup>, NO<sub>2</sub><sup>-</sup>, Fe, Mn, As, Sr, pH) and well depth as explanatory variables, and a total of 1089 groundwater samples. The results showed the hydrochemical dataset could effectively predict the risk of nitrate contamination, with a minimum accuracy of 82.7% in LR and maximal accuracy of 91.7% in SVM and Xgboost. However, only the Xgboost model under a cutoff probability of 0.3 had the best performance with the highest sensitivity of 80.3% and AUC 0.95, whereas other models had sensitivity lower than 60% with insufficient capability of identifying contaminated groundwater samples. The results showed that the ensemble learning method had a strong, robust prediction capability. In addition, the relative importance of K<sup>+</sup>, SO<sub>4</sub><sup>2-</sup>, and Cl<sup>-</sup> exceeded 0.65, indicating the dominant influence of domestic or industrial sewage in the study area due to widespread urbanization. Finally, we examined the relationship among nitrate contamination risk, land use type, the intensity of anthropogenic activities, and redox conditions and obtained the risk map of nitrate contamination in the study area. This study successfully proved the validity of predicting the risk of groundwater nitrate contamination using machine learning tools, which favors regional groundwater management and protection.</p><p><strong>Keywords: </strong>groundwater; nitrate contamination; risk prediction; machine learning</p>

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call