Abstract
Structure–activity relationship (SAR) models can be used to predict the biological activity of potential developmental toxicants whose adverse effects include death, structural abnormalities, altered growth and functional deficiencies in the developing organism. Physico-chemical descriptors of spatial, electronic and lipophilic properties were used to derive SAR models by two modeling approaches, logistic regression and Classification and Regression Tree (CART), using a new developmental database of 293 chemicals (FDA/TERIS). Both single models and ensembles of models (termed bagging) were derived to predict toxicity. Assessment of the empirical distributions of the prediction measures was performed by repeated random partitioning of the data set. Results showed that both the decision tree and logistic regression derived developmental SAR models exhibited modest prediction accuracy. Bagging tended to enhance the prediction accuracy and reduced the variability of prediction measures compared to the single model for CART-based models but not consistently for logistic-based models. Prediction accuracy of single logistic-based models was higher than single CART-based models but bagged CART-based models were more predictive. Descriptor selection in SAR for the understanding of the developmental mechanism was highly dependent on the modeling approach. Although prediction accuracy was similar in the two modeling approaches, there was inconsistency in the model descriptors.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have