Abstract

The apparent second-order rate constant with hexavalent ferrate (Fe(VI)) (kFe(VI)) is a key indicator to evaluate the removal efficiency of a molecule by Fe(VI) oxidation. kFe(VI) is often determined by experiment, but such measurements can hardly catch up with the rapid growth of organic compounds (OCs). To address this issue, in this study, a total of 437 experimental second-order kFe(VI) rate constants at a range of conditions (pH and temperature) were used to train four machine learning (ML) algorithms (lasso regression (LR), ridge regression (RR), extreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM)). Using the Morgan fingerprint (MF)) of a range of organic compounds (OCs) as the input, the performance of the four algorithms was comprehensively compared with respect to the coefficient of determination (R2) and root-mean-square error (RMSE). It is shown that the RR, XGBoost, and LightGBM models displayed generally acceptable performance kFe(VI) (R2test > 0.7). In addition, the shapely additive explanation (SHAP) and feature importance methods were employed to interpret the XGBoost/LightGBM and RR models, respectively. The results showed that the XGBoost/LightGBM and RR models suggestd pH as the most important predictor and the tree-based models elucidate how electron-donating and electron-withdrawing groups influence the reactivity of the Fe(VI) species. In addition, the RR model share eight common features, including pH, with the two tree-based models. This work provides a fast and acceptable method for predicting kFe(VI) values and can help researchers better understand the degradation behavior of OCs by Fe(VI) oxidation from the perspective of molecular structure.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.