Appropriate carbon source addition can save operational costs and reduce surplus sludge yield in the wastewater treatment plant (WWTP). However, the link between carbon source and surplus sludge yield remains neglected although machine learning (ML) has become a powerful tool for WWTP, and is a challenge due to more complex multidimensional pattern recognition. Herein, weighted average ensemble strategy was conducted to assemble multiple diverse basic models to obtain better prediction capability to optimize carbon source addition (Model-1) and further control surplus sludge yield (Model-2). The ensemble models significantly outperformed all single models with MAE of 5.82g/m3, MSE of 60.59 and R2 value of 0.98 in Model-1 and MAE of 15.09g/m3, MSE of 449.01 and R2 value of 0.93 in Model-2. The optimal input feature subset was explored to reduce model complexity, indicating that the final ensemble models can predict with high precision using relatively few features with MAE of 6.41g/m3, MSE of 78.49 and R2 value of 0.97 in Model-1 and MAE of 12.82g/m3, MSE of 232.71 and R2 value of 0.95 in Model-2. Furthermore, the final models were deployed into an offline web application to facilitate their utility in real-world settings, demonstrating 47.25 % savings in carbon source addition and 15.89 % reductions in surplus sludge yield for an extra month of running. This work offers an efficient approach for the WWTP to optimize carbon source addition and provides new insights into controlling surplus sludge yield.
Read full abstract