Abstract

Evaluating the quality of groundwater in a specific aquifer could be a costly and time-consuming procedure. An attempt was made in this research to predict various parameters of water quality called Fe, Cl, SO4, pH and total hardness (as CaCO3) by measuring properties of total dissolved solids (TDSs) and electrical conductivity (EC). This was reached by establishing relations between groundwater quality parameters, TDS and EC, using various machine learning (ML) models, such as linear regression (LR), tree regression (TR), Gaussian process regression (GPR), support vector machine (SVM), and ensembles of regression trees (ER). Data for these variables were gathered from five unrelated groundwater quality studies. The findings showed that the TR, GPR, and ER models have satisfactory performance compared to that of LR and SVM with respect to different assessment criteria. The ER model attained higher accuracy in terms of R2 in TDS 0.92, Fe 0.89, Cl 0.86, CaCO3 0.87, SO4 0.87, and pH 0.86, while the GPR model attained an EC 0.98 compared to all developed models. Moreover, comparisons among the different developed models were performed using accuracy improvement (AI), improvement in RMSE (PRMSE), and improvement in PMAE to determine a higher accuracy model for predicting target properties. Generally, the comparison of several data-driven regression methods indicated that the boosted ensemble of the regression tree model offered better accuracy in predicting water quality parameters. Sensitivity analysis of each parameter illustrates that CaCO3 is most influential in determining TDS and EC. These results could have a significant impact on the future of groundwater quality assessments.

Highlights

  • The rising need for clean drinking water draws awareness for the management of groundwater quality

  • R2, root mean square error (RMSE), mean square error (MSE), and mean absolute error (MAE) are significant performance measurements, and consuming time in training is considered a significant metric to validate the quality of the model

  • The outcomes of this study can be summarized as follows: In terms of accuracy, which is represented via R2, each tree regression (TR), Gaussian process regression (GPR), and ensembles of regression trees (ER) has satisfactory performance

Read more

Summary

Introduction

The rising need for clean drinking water draws awareness for the management of groundwater quality. The quality of a specific groundwater resource is connected to it as a natural constituent, for example, the several microorganisms, sediments, and chemical compounds that exist in it. Chloride (Cl), for example, may lead to changes in savor; it is not poisonous to humans. It might cause corrosion of metals in the well and pipe if it occurs at concentrations higher than 250 mg/L. Reliant on the size of a specific resource and the site of wells, several samples may be required to establish a representative quality evaluation. After the sampling process is completed, there is frequent requirement for off-site laboratory analysis to determine the concentration of several ions. Measures of water quality, such as pH, total dissolved solids (TDSs), and electrical conductivity (EC), could be measured on-site using digital meters

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call