Abstract

The soil adsorption coefficient (Koc) is an environmental fate parameter that is essential for environmental risk assessment. However, obtaining Koc requires a significant amount of time and enormous expenditure. Thus, it is necessary to efficiently estimate Koc in the early stages of a chemical's development. In this study, a quantitative structure-property relationship (QSPR) model was developed using calculated physicochemical properties and molecular descriptors with the OPEn structure-activity/property Relationship App (OPERA) and Mordred software using the largest available Koc dataset. Specifically, we compared the accuracies of the model using the light gradient boosted machine (LightGBM), a gradient boosting decision tree (GBDT) algorithm, with those of previous models. The experimental results suggested the potential to develop a QSPR model that will produce highly accurate Koc values using molecular descriptors and physicochemical properties. Unlike previous studies, the use of a combination of LightGBM, OPERA and Mordred enables the prediction of Koc for many chemicals with high accuracy. In this study, OPERA was used to calculate the physicochemical properties, and Mordred was used to calculate molecular descriptors. The wide range of chemicals covered by OPERA and Mordred enables the analysis of a diverse range of chemical compounds. We also report a method to tune the LightBGM program. The use of fast-processing software, such as LightGBM, enables parameter tuning of a method required to obtain best performance. Our research represents one of the few studies in the field of environmental chemistry to use LightGBM. Using physicochemical properties as well as molecular descriptors, we could develop highly accurate Koc prediction models when compared to prior studies. In addition, our QSPR models may be useful for preliminary environmental risk assessment without incurring significant costs during the early chemical developmental stage.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call