River water quality is of utmost importance because the river is not only one of the key water resources but also a natural habitat serving its surrounding environment. In a bid to address whether it has a qualified quality, various analytics are required to be considered, but it is challenging to measure all of them frequently along a river reach. Therefore, estimating water quality index (WQI) incorporating several weighted analytics is a useful approach to assess water quality in rivers. This study explored applications of ten machine learning (ML) models to estimate WQI for the Southern Bug River, which is the second-longest river in Ukraine. The ML methods considered in this study include artificial neural networks (ANNs), Support Vector Regressor (SVR), Extreme Learning Machine, Decision Tree Regressor, random forest, AdaBoost (AB), Gradient Boosting Regressor, XGBoost Regressor (XGBR), Gaussian process (GP), and K-nearest neighbors (KNN). Each data measurement consists of nine analytics (NH4, BOD5, suspended solids, DO, NO3, NO2, SO4, PO4, Cl), while the quantity of data is more than 2700 data points. The results indicated that all ML models demonstrate satisfactory performance in predicting WQI. However, GP outperformed the other models, followed by XGBR, SVR, and KNN. Furthermore, ANN and AB demonstrated relatively weaker performance. Moreover, a reliability assessment conducted on both training and testing datasets also confirmed the results of the comparative analysis. Overall, the results enhance the assertion that ML models can sufficiently predict WQI, thereby enhancing water quality management.
Read full abstract