Abstract
For the first time, a global regression quantitative structure-toxicity/activity relationship (QSTR/QSAR) model was developed for the toxicity of a large data set including 1236 chemicals towards Vibrio fischeri, by using random forest (RF) regression algorithm. The optimal RF model with RF parameters of mtry = 3, ntree = 150 and nodesize = 5 was based on 13 molecular descriptors. It can achieve accurate prediction for the toxicity of 99.1% of 1236 chemicals, and yield coefficients of determination R2 of 0.893 for 930 log(Mw/IBC50) in the training set, 0.723 for 306 log(Mw/IBC50) in the test se, and 0.865 for 1236 toxicity log(Mw/IBC50) in the total set. The optimal RF global model proposed in this work is comparable to other published local QSTR models on small datasets of the toxicity to Vibrio fischeri.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Archives of environmental contamination and toxicology
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.