Abstract

The Caco-2 cell model is widely used to evaluate the in vitro human intestinal permeability of drugs due to its morphological and functional similarity to human enterocytes. Although it is safe and relatively economic, it is time-consuming. A rapid and accurate quantitative structure-property relationship (QSPR) model of Caco-2 permeability is helpful to improve the efficiency of oral drug development. The aim of our study is to explore the predictive ability of the QSPR model, to study its permeation mechanism, and to develop a potential permeability prediction model, for Caco-2 cells. In our study, a relatively large data set was collected and the abnormal data were eliminated using the Monte Carlo regression and hybrid quantum particle swarm optimization (HQPSO) algorithm. Then, the remaining 1827 compounds were used to establish QSPR models. To generate multiple chemically diverse training and test sets, we used a combination of principal component analysis (PCA) and self-organizing mapping (SOM) neural networks to split the modeling data set characterized by PaDEL-descriptors. After preliminary selection of descriptors by the mean decrease impurity (MDI) method, the HQPSO algorithm was used to select the key descriptors. Six different methods, namely, multivariate linear regression (MLR), support vector machine regression (SVR), xgboost, radial basis function (RBF) neural networks, dual-SVR and dual-RBF were employed to develop QSPR models. The best dual-RBF model was obtained finally with R2 = 0.91, and Rcv52 = 0.77, for the training set, and RT2 = 0.77, for the test set. A series of validation methods were used to assess the robustness and predictive ability of the dual-RBF model under OECD principles. A new application domain (AD) definition method based on the descriptor importance-weighted and distance-based (IWD) method was proposed, and the outliers were analyzed carefully. Combined with the importance of the descriptors used in the dual-RBF model, we concluded that the “H E-state” and hydrogen bonds are important factors affecting the permeability of drugs passing through the Caco-2 cell. Compared with the reported studies, our method exhibits certain advantages in data size, transparency of modeling process and prediction accuracy to some extent, and is a promising tool for virtual screening in the early stage of drug development.

Highlights

  • In the process of drug development, lot of candidate drugs fail to become drugs mainly because of their safety issues and lack of efficacy.[1]

  • 0.79 No 58 0.77 No 17 0.81 Yes 3 — No 15 0.76 Yes Present a Genetic algorithm-neural network. b Arti cial neural networks. c Random forest; Nm is the number of compounds in the modeling set, and Nde is the number of descriptors used in each model

  • We have developed a new Caco-2 permeability prediction model based on dual-radial basis function (RBF) neural networks under the Organization for Economic Cooperation and Development (OECD) principle

Read more

Summary

Introduction

In the process of drug development, lot of candidate drugs fail to become drugs mainly because of their safety issues and lack of efficacy.[1] This is the main reason for high costs and timeconsumption in pharmaceutical engineering. In every stage of drug discovery and development, absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties of chemicals play vital role; so rapid evaluation of ADMET is the 42938 | RSC Adv., 2020, 10, 42938–42952. It is difficult to realize the HTS of drugs, not to mention the virtual screening in the early stage of drug discovery.[3,11] a rapid, accurate and economic model of Caco-2 permeability is the key to improve the efficiency of oral drug development

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call