Abstract

AbstractMany environmental and industrial chemicals are reported to have androgenic or antiandrogenic activities. These androgenic chemicals may act as hormones and have the potential to disrupt the endocrine systems of wildlife and humans. In this study, the probabilistic neural network (PNN), support vector machine (SVM), and learning vector quantization (LVQ), three types of machine learning, were used to develop binary classification models to predict androgenicity directly from the organic compounds' molecular structures which were represented by only eleven numerical descriptors. The PNN model acquired the best overall classification rate of 86.67% for prediction data set, with Matthews Correlation Coefficient of 0.64, and the LVQ model gave the lowest false negative rate of 0.00%, which will tend to give relatively high priority during toxicology evaluation. In addition, a consensus model was produced that integrated all three of the basic model types. Compared with the individual models, this consensus model correctly predicted the androgenicity of 86.67% of the prediction set compounds, with false negative rate of 0.00% and the highest Matthews Correlation Coefficient of 0.65. The obtained results indicate that the proposed classification models could provide a feasible and practical tool for the rapid screening of potential androgens.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.