Machine learning methodology has recently been considered a smart and reliable way to monitor water quality parameters in aquatic environments like reservoirs and lakes. This study employs both individual and hybrid-based techniques to boost the accuracy of dissolved oxygen (DO) and chlorophyll-a (Chl-a) predictions in the Wadi Dayqah Dam located in Oman. At first, an AAQ-RINKO device (CTD+ sensor) was used to collect water quality parameters from different locations and varying depths in the reservoir. Second, the dataset is segmented into homogeneous clusters based on DO and Chl-a parameters by leveraging an optimized K-means algorithm, facilitating precise estimations. Third, ten sophisticated variational-individual data-driven models, namely generalized regression neural network (GRNN), random forest (RF), gaussian process regression (GPR), decision tree (DT), least-squares boosting (LSB), bayesian ridge (BR), support vector regression (SVR), K-nearest neighbors (KNN), multilayer perceptron (MLP), and group method of data handling (GMDH) are employed to estimate DO and Chl-a concentrations. Fourth, to improve prediction accuracy, bayesian model averaging (BMA), entropy weighted (EW), and a new enhanced clustering-based hybrid technique called Entropy-ORNESS are employed to combine model outputs. The Entropy-ORNESS method incorporates a Genetic Algorithm (GA) to determine optimal weights and then combine them with EW weights. Finally, the inclusion of bootstrapping techniques introduces a stochastic assessment of model uncertainty, resulting in a robust estimator model. In the validation phase, the Entropy-ORNESS technique outperforms the independent models among the three fusion-based methods, yielding R2 values of 0.92 and 0.89 for DO and Chl-a clusters, respectively. The proposed hybrid-based methodology demonstrates reduced uncertainty compared to single data-driven models and two combination frameworks, with uncertainty levels of 0.24% and 1.16% for cluster 1 of DO and cluster 2 of Chl-a parameters. As a highlight point, the spatial analysis of DO and Chl-a concentrations exhibit similar pattern variations across varying depths of the dam according to the comparison of field measurements with the best hybrid technique, in which DO concentration values notably decrease during warmer seasons. These findings collectively underscore the potential of the upgraded weighted-based hybrid approach to provide more accurate estimations of DO and Chl-a concentrations in dynamic aquatic environments.
Read full abstract