Abstract
Wastewater treatment plants (WWTPs) are complex systems presenting stochastic, non-linear, and non-stationary behavior, which makes their operational management very challenging. In this context, data collected from distributed sources across the plant play a central role in the optimized operation and control of WWTPs. However, even when available, the use of collected data is far from trivial due to the coexistence of asynchronous measurements, data with different granularity, measurements of different quality (precision, accuracy), multimodal sources (sensors, spectra, images, hyphenated instrumentation), among other aspects related to the data life cycle. Such heterogeneity in process data characteristics hinders the application of most off-the-shelf data analytics methods. Flexible solutions able to cope with the complexity of systems and of the data they generate are therefore necessary to overcome these limitations and enable an effective analysis and operation of WWTPs. In this article, data-fusion approaches for handling multiple heterogeneous sources of process data are developed and comparatively tested. Priority is given to solutions that can flexibly be adapted to different specific operational contexts. The methodologies are tested on an industrial case study (WWTP), where the concentration of a toxin in the effluent stream is to be predicted from available heterogeneous data. Single- and multi-source modeling approaches are contemplated and a nested cross-validation method was developed to handle the time-series nature of the models. Bayesian fusion synergistically combines data from different sources considering their uncertainty, standing out among the methodologies tested as offering a good balance in terms of accuracy (RMSEP = 1.34), stability (prequential IQR = 0.034), and flexibility (to accommodate missing and new sources).
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.