Abstract
Due to the intrinsic complexity of wastewater treatment plant (WWTP) processes, it is always challenging to respond promptly and appropriately to the dynamic process conditions in order to ensure the quality of the effluent, especially when operational cost is a major concern. Machine Learning (ML) methods have therefore been used to model WWTP processes in order to avoid various shortcomings of conventional mechanistic models. However, to the best of the authors' knowledge, no ML applications have focused on investigating how operational factors can affect effluent quality. Additionally, the time lags between process steps have always been neglected, making it difficult to explain the relationships between operational factors and effluent quality. Therefore, this paper presents a novel ML-based framework designed to improve effluent quality control in WWTPs by clarifying the relationships between operational variables and effluent parameters. The framework consists of Random Forest (RF) models, Deep Neural Network (DNN) models, Variable Importance Measure (VIM) analyses, and Partial Dependence Plot (PDP) analyses, and uses a novel approach to account for the impact of time lags between processes. Details of the framework are provided along with a demonstration of its practical applicability based on a case study of the Umeå WWTP in Sweden involving a large number of samples (105763) representing the full scale of the plant's operations. Two effluent parameters, Total Suspended Solids in effluent (TSSe) and Phosphate in effluent (PO4e), and thirty-two operational variables are studied. RF models are developed, validated using DNN models as references, and shown to be suitable for VIM and PDP analyses. VIM identifies the variables that most strongly influence TSSe and PO4e, while PDP elucidates their specific effects on TSSe and PO4e. The major findings are: (1) Influent temperature is the most influential variable for both TSSe and PO4e, but it affects them in different ways; (2) PO4e depends strongly on the TSS in aeration basins – higher TSS concentrations in aeration basins generally promote PO4 removal, but excess TSS can have negative effects; (3) In general, the impact of TSS in aeration basins on TSSe and PO4e increases with the distances of the basin from the merging outlet, so more attention should be paid to the TSS concentration in the third or fourth aeration basins than the first and second ones; (4) Returning excessive amounts of sludge through the second return sludge pipe should be avoided because of its adverse impact on TSSe removal. These results could support the development of more advanced control strategies to increase control precision and reduce running costs in the Umeå WWTP and other similarly configured WWTPs. The framework could also be applied to other parameters in WWTPs and industrial processes in general if sufficient high-resolution data are available.
Highlights
Wastewater treatment plants (WWTPs) are complex, nonlinear systems with high fluctuations in flow rate, pollutant load, chemical environment, and hydraulic conditions
This paper presents a detailed description of the framework and demonstrates its applicability in engineering scenarios through a case study on the Umeå WWTP in Sweden involving a large amount of data (105,763 samples) representing the full scale of the plant's operations
Xs is the variable for which the partial dependence function is plotted and Xc are the other variables used as inputs in the Machine Learning (ML) model: 1 Here, an R2 value larger than 0.85 was considered ‘good enough’
Summary
Wastewater treatment plants (WWTPs) are complex, nonlinear systems with high fluctuations in flow rate, pollutant load, chemical environment, and hydraulic conditions. Due to these complexities and uncertainties, modeling WWTP processes is challenging (Borzooei et al, 2019; Rout et al, 2021; Vučić et al, 2021). An important advantage of ML models is that they reflect real reaction/process situations rather than mechanisms formulated in advance based on fundamental principles They are robust and comprehensive, which is important because many mechanisms involved in wastewater treatment remain unclear (Chan and Huang, 2003; Erdirencelebi and Yalpir, 2011; Faruk, 2010; Lee et al, 2002; Nadiri et al, 2018). ML modeling is widely used as an alternative to mechanistic modeling of WWTPs (Cao and Yang, 2020; Guo et al, 2015; Liu et al, 2020; Shi and Xu, 2018; Singh et al, 2010; Verma et al, 2013)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.