Physicochemical Descriptors Research Articles

The in vitro-in vivo extrapolation (IVIVE) approach for predicting total plasma clearance (CLtot) has been widely used to rank order compounds early in discovery. More recently, a computational machine learning approach utilizing physicochemical descriptors and fingerprints calculated from chemical structure information has emerged, enabling virtual predictions even earlier in discovery. Previously, this approach focused more on in vitro intrinsic clearance (CLint) prediction. Herein, we directly compare these two approaches for predicting CLtot in rats. A structurally diverse set of 1114 compounds with known in vivo CLtot, in vitro CLint, and plasma protein binding was used as the basis for this evaluation. The machine learning models were assessed by validation approaches using the time- and cluster-split training and test sets, and five-fold cross validation. Assessed by five-fold validation, the random forest regression (RF) and radial basis function (RBF) models demonstrated better prediction performance in eight attempted machine learning models. The CLtot values predicted by the RF and RBF models were within two-fold of the observed values for 67.7 and 71.9% of cluster-split test set compounds, respectively, while the predictivity was worse in the time-split dataset. The predictivity of both models tended to be improved by incorporating in vitro parameters, unbound fraction in plasma (fu,p), and CLint. CLtot prediction utilizing in vitro CLint and the well-stirred model, correcting for the fraction unbound in blood, was substantially worse compared to machine learning approaches for the same cluster-split test set. The reason that CLtot is underestimated by IVIVE is not fully explained by considering the calculated microsomal unbound fraction (cfu,mic), extended clearance classification system (ECCS), and omitting high clearance compounds in excess of hepatic blood flow. The analysis suggests that in silico machine learning models may have the power to reduce reliance on or replace in vitro and in vivo studies for chemical structure optimization in early drug discovery.

Read full abstract

In micellar liquid chromatography (MLC), the addition of a surfactant to the mobile phase in excess is accompanied by an alteration of its solubilising capacity and a change in the stationary phase's properties. As an implication, the prediction of the analytes’ retention in MLC mode becomes a challenging task. Mixed Quantitative Structure – Retention Relationships (QSRR) modelling represents a powerful tool for estimating the analytes’ retention.This study compares 48 successfully developed mixed QSRR models with respect to their ability to predict retention of aripiprazole and its five impurities from molecular structures and factors that describe the Brij - acetonitrile system. The development of the models was based on an automatic combining of six attribute (feature) selection methods with eight predictive algorithms and the optimization of hyper-parameters. The feature selection methods included Principal Component Analysis (PCA), Non-negative Matrix Factorization (NMF), ReliefF, Multiple Linear Regression (MLR), Mutual Info and F-Regression. The series of investigated predictive algorithms comprised Linear Regressions (LR), Ridge Regression, Lasso Regression, Artificial Neural Networks (ANN), Support Vector Regression (SVR), Random Forest (RF), Gradient Boosted Trees (GBT) and K-Nearest neighbourhood (k-NN).A sufficient amount of data for building the model (78 cases in total) was provided by conducting 13 experiments for each of the 6 analytes and collecting the target responses afterwards. Different experimental settings were established by varying the values of the concentration of Brij L23, pH of the aqueous phase and acetonitrile content in the mobile phase according to the Box-Behnken design. In addition to the chromatographic parameters, the pool of independent variables was expanded by 27 molecular descriptors from all major groups (physicochemical, quantum chemical, topological and spatial structural descriptors). The best model was chosen by taking into consideration the Root Mean Square Error (RMSE) and cross-validation (CV) correlation coefficient (Q2) values.Interestingly, the comparative analysis indicated that a change in the set of input variables had a minor impact on the performance of the final models. On the other hand, different regression algorithms showed great diversity in the ability to learn patterns conserved in the data. In this regard, testing many regression algorithms is necessary in order to find the most suitable technique for model building. In the specific case, GBT-based models have demonstrated the best ability to predict the retention factor in the MLC mode. Steric factors and dipole-dipole interactions have proven to be relevant to the observed retention behaviour. This study, although being of a smaller scale, is a most promising starting point for comprehensive MLC retention prediction.

Read full abstract

Physicochemical Descriptors Research Articles

Related Topics

Articles published on Physicochemical Descriptors

Structure-dependent effects of sweet and sweet taste affecting compounds on their sensorial properties

Quantitative Structure-Activity Relationship Studies Of Series Of Chalcones Derivatives as Inhibitors Of Tumor Necrosis Factor-Alpha

Cardiovascular Effects of Polychlorinated Biphenyls and Their Major Metabolites.

Cytochrome-P450-Mediated Drug-Drug Interactions of Substrate Drugs: Assessing Clinical Risk Based on Molecular Properties and an Extended Clearance Classification System.

Elaboration of Novel TTK1 Inhibitory Leads via QSAR-Guided Selection of Crystallographic Pharmacophores Followed By In Vitro Assay.

QSAR-derived affinity fingerprints (part 2): modeling performance for potency prediction

Comparison of statistical methods for predicting penetration capacity of drugs into human breast milk using physicochemical, pharmacokinetic and chromatographic descriptors

Direct Comparison of Total Clearance Prediction: Computational Machine Learning Model versus Bottom-Up Approach Using In Vitro Assay

Comprehensive quantum mechanical studies on three bioactive anastrozole based triazole analogues and their SERS active graphene complex

Performance comparison of nonlinear and linear regression algorithms coupled with different attribute selection methods for quantitative structure - retention relationships modelling in micellar liquid chromatography

Solvent Selection Scheme Using Machine Learning Based on Physicochemical Description of Solvent Molecules: Application to Cyclic Organometallic Reaction

A theoretical evaluation on free radical scavenging activity of 3-styrylchromone derivatives: the DFT study.

QSAR modeling, molecular docking and ADMET/pharmacokinetic studies: a chemometrics approach to search for novel inhibitors of norepinephrine transporter as potent antipsychotic drugs

Comprehensive investigation of selectivity landscape of glycogen synthase kinase-3 inhibitors

Development of a Gaussian Process - feature selection model to characterise (poly)dimethylsiloxane (Silastic® ) membrane permeation.

Detailed quantum mechanical studies on bioactive benzodiazepine derivatives and their adsorption over graphene sheets.

Updating the portfolio of physicochemical descriptors related to permeability in the beyond the rule of 5 chemical space

Inclusion of molecular descriptors in predictive models improves pesticide soil-air partitioning estimates

Polymorphism of monotropic forms: relationships between thermochemical and structural characteristics.

Prediction of peptide binding to MHC using machine learning with sequence and structure-based feature sets

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Physicochemical Descriptors Research Articles

Related Topics

Articles published on Physicochemical Descriptors

Structure-dependent effects of sweet and sweet taste affecting compounds on their sensorial properties

Quantitative Structure-Activity Relationship Studies Of Series Of Chalcones Derivatives as Inhibitors Of Tumor Necrosis Factor-Alpha

Cardiovascular Effects of Polychlorinated Biphenyls and Their Major Metabolites.

Cytochrome-P450-Mediated Drug-Drug Interactions of Substrate Drugs: Assessing Clinical Risk Based on Molecular Properties and an Extended Clearance Classification System.

Elaboration of Novel TTK1 Inhibitory Leads via QSAR-Guided Selection of Crystallographic Pharmacophores Followed By In Vitro Assay.

QSAR-derived affinity fingerprints (part 2): modeling performance for potency prediction

Comparison of statistical methods for predicting penetration capacity of drugs into human breast milk using physicochemical, pharmacokinetic and chromatographic descriptors

Direct Comparison of Total Clearance Prediction: Computational Machine Learning Model versus Bottom-Up Approach Using In Vitro Assay

Comprehensive quantum mechanical studies on three bioactive anastrozole based triazole analogues and their SERS active graphene complex

Performance comparison of nonlinear and linear regression algorithms coupled with different attribute selection methods for quantitative structure - retention relationships modelling in micellar liquid chromatography

Solvent Selection Scheme Using Machine Learning Based on Physicochemical Description of Solvent Molecules: Application to Cyclic Organometallic Reaction

A theoretical evaluation on free radical scavenging activity of 3-styrylchromone derivatives: the DFT study.

QSAR modeling, molecular docking and ADMET/pharmacokinetic studies: a chemometrics approach to search for novel inhibitors of norepinephrine transporter as potent antipsychotic drugs

Comprehensive investigation of selectivity landscape of glycogen synthase kinase-3 inhibitors

Development of a Gaussian Process - feature selection model to characterise (poly)dimethylsiloxane (Silastic® ) membrane permeation.

Detailed quantum mechanical studies on bioactive benzodiazepine derivatives and their adsorption over graphene sheets.

Updating the portfolio of physicochemical descriptors related to permeability in the beyond the rule of 5 chemical space

Inclusion of molecular descriptors in predictive models improves pesticide soil-air partitioning estimates

Polymorphism of monotropic forms: relationships between thermochemical and structural characteristics.

Prediction of peptide binding to MHC using machine learning with sequence and structure-based feature sets