Background: Prediction of toxicity of imidazo[4,5-b]pyridine derivatives is carried out using GA-MLR and BPANN methods. Objective: A quantitative structure-property relationship (QSPR) was determined based on methods, including genetic algorithm-multiple linear regression (GA-MLR) and backpropagation artificial neural network (BP-ANN). These methods were employed for modeling and predicting the anticancer potency of imidazo[4,5-b]pyridine derivatives. Materials and Methods: A dataset of imidazo[4,5-b]pyridine derivatives was randomly divided into two groups, training and test sets consisting of 75% and 25% of data points, respectively. The optimized conformation of compounds was obtained using the DFT-B3LYP method and 6-31G* basis sets level with Gaussian 09 software. A large number of molecular descriptors were calculated using Dragon software. The QSAR models were optimized using multiple linear regressions (MLR). Results: The most relevant molecular descriptors were obtained using the genetic algorithm (GA) and backward stepwise regression. The predictive powers of the GA-MLR models were studied using leaveone- out (LOO) cross-validation and an external test set. Conclusion: The obtained results of statistical parameters showed the BP-ANN model to have better performance compared to the GA-MLR model. To assess the predictive ability of QSAR models, many statistical terms, such as correlation coefficient (R2), leave-one-out cross-validation (LOOCV), root mean squared error (RMSE), and external and internal validation were used. The results of validation methods demonstrate the QSAR model to be robust and with high predictivity.
Read full abstract