Enhancing water quality prediction for fluctuating missing data scenarios: A dynamic Bayesian network-based processing system to monitor cyanobacteria proliferation

M Pazo,S Gerassis,M Araújo,I Margarida Antunes,X Rigueira

doi:10.1016/j.scitotenv.2024.172340

M Pazo, S Gerassis + Show 3 more

Open Access

https://doi.org/10.1016/j.scitotenv.2024.172340

Copy DOI

Abstract

Tackling the impact of missing data in water management is crucial to ensure the reliability of scientific research that informs decision-making processes in public health. The goal of this study is to ascertain the root causes associated with cyanobacteria proliferation under major missing data scenarios. For this purpose, a dynamic missing data management methodology is proposed using Bayesian Machine Learning for accurate surface water quality prediction of a river from Limia basin (Spain). The methodology used entails a sequence of analytical steps, starting with data pre-processing, followed by the selection of a reliable dynamic Bayesian missing value prediction system, leading finally to a supervised analysis of the behavioral patterns exhibited by cyanobacteria. For that, a total of 2,118,844 data points were used, with 205,316 (9.69 %) missing values identified. The machine learning testing showed the iterative structural expectation maximization (SEM) as the best performing algorithm, above the dynamic imputation (DI) and entropy-based dynamic imputation methods (EBDI), enhancing in some cases the accuracy of imputations by approximately 50 % in R2, RMSE, NRMSE, and logarithmic loss values. These findings can impact how data on water quality is being processed and studied, thus, opening the door for more reliable water management strategies that better inform public health decisions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing water quality prediction for fluctuating missing data scenarios: A dynamic Bayesian network-based processing system to monitor cyanobacteria proliferation

Abstract

Talk to us

Similar Papers

More From: Science of The Total Environment

Lead the way for us

Journal: Science of The Total Environment	Publication Date: Apr 10, 2024
License type: cc-by

Similar Papers

REVIEW PAPER ON PREDICTION OF WATER QUALITY PARAMETER USING MACHINE LEARNING
Shubham Shivhare ... Atul Sharma
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07
Shubham Shivhare, et. al.Shubham Shivhare ... Atul Sharma
23 Mar 2023
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07

A Comprehensive Review of Machine Learning for Water Quality Prediction over the Past Five Years
Xiaohui Yan ... Tianqi Zhang
Journal of Marine Science and Engineering | VOL. 12
Xiaohui Yan, et. al.Xiaohui Yan ... Tianqi Zhang
13 Jan 2024
Journal of Marine Science and Engineering | VOL. 12

Toward urban sustainability and clean potable water: Prediction of water quality via artificial neural networks
Thikra Dawood ... Emad Elwakil
Journal of Cleaner Production | VOL. 291
Thikra Dawood, et. al.Thikra Dawood ... Emad Elwakil
26 Nov 2020
Journal of Cleaner Production | VOL. 291

Exploring Machine Learning Algorithms for Reliable Water Quality Prediction
Deepak Thakur ... A J Singh
International Journal for Research in Applied Science and Engineering Technology | VOL. 11
Deepak Thakur, et. al.Deepak Thakur ... A J Singh
30 Sep 2023
International Journal for Research in Applied Science and Engineering Technology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing water quality prediction for fluctuating missing data scenarios: A dynamic Bayesian network-based processing system to monitor cyanobacteria proliferation

Abstract

Talk to us

Similar Papers

More From: Science of The Total Environment