Abstract

Exploring key factors has important guidance for understanding complex anaerobic digestion (AD) systems. This study proposed a multi-layer automated machine learning framework to understand the complex interactions in AD systems and explore key factors at the environmental factor, microorganisms and system levels. The first layer of the framework identified hydraulic residence time (HRT) as the most important environmental factor, with an optimal range of 33–45 d. In the second layer of the framework, Methanocelleus (optimal relative abundance (ORA) = 3.0%) and Candidatus_Caldatribacterium (ORA = 1.7%) were found to be the key archaea and bacteria, respectively. Furthermore, the prediction of key microorganisms based on environmental factors and remaining microbial data showed the essential roles of Methanothermobacter and Acetomicrobium. The third layer for finding the optimal combination of data variables for predicting biogas production demonstrated that combined Archaea genera and environmental factors should be achieved for the most accurate prediction (root mean square error (RMSE) = 84.21). GBM had the best model performance and prediction accuracy among all the built-in models. Based on the optimal GBM model, the analysis at the system level showed that HRT was the most important variable. However the most important microorganism, Methanocelleus, within the appropriate survival range is also essential to achieve optimal biogas production. This research explores key parameters at various levels through automated machine learning techniques, which are expected to provide guidance in understanding the complex architecture of industrial and laboratory AD systems.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call