Abstract

Water quality assessment is essential for safeguarding public health and protecting water resources. This study focused on predicting water quality, specifically the presence of total coliforms, using various machine-learning techniques. The present study utilises a publicly available dataset encompassing the geographical area of India consisting of various physical water quality parameters. Various regression techniques were applied to the dataset after appropriate pre-processing including feature selection and normalisation. The findings demonstrate that gradient boosting regression outperforms other methods, achieving high accuracy with mean absolute error (MAE) of 0.0349, mean squared error (MSE) of 0.0038, and root mean squared error (RMSE) of 0.0620. Conductivity and temperature emerged as the most influential factors in total coliform prediction, as revealed by feature importance analysis. These results contribute to water quality understanding, aiding water resource management for public health protection. By accurately predicting total coliform presence, proactive measures can be taken timely to mitigate and minimise health risks associated with microbial contamination.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.