Abstract

Assessment of reproductive toxicity is one of the important safety considerations in drug development. Thus, in the present research, the naïve Bayes (NB)-classifier method was applied to develop binary classification models. Six important molecular descriptors for reproductive toxicity were selected by the genetic algorithm. Then, 110 classification models were developed using six molecular descriptors and10 types of fingerprints with 11 different maximum diameters. Among these established models, the model based on six molecular descriptors and the SciTegic extended-connectivity fingerprints with 20 maximum diameters (LCFC_20) displayed the best prediction performance for reproductive toxicity (NB-1), which gave a 0.884 receiver operating characteristic (ROC) score and 91.8% overall prediction accuracy for the Training Set, and produced a 0.888 ROC score and 83.0% overall accuracy for the external Test Set I. In addition, for the external rat multi-generation reproductive toxicity dataset (Test Set II), the NB-1 model generated a 0.806 ROC score and 85.1% concordance. The generated prediction results indicated that the NB-1 model could give robust and reliable predictions for a reproductive toxicity potential of chemicals. Thus, the established model could be applied to filter early-stage molecules for potential reproductive adverse effects. In addition, six important molecular descriptors and new structural alerts for reproductive toxicity were identified, which could help medicinal chemists rationally guide the optimization of lead compounds and select chemicals with the best prospects of being safe and effective.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call