External Prediction Set Research Articles

Quantitative structure-activity relationship (QSAR) methodology aims to explore the relationship between molecular structures and experimental endpoints, producing a model for the prediction of new data; the predictive performance of the model must be checked by external validation. Clearly, the qualities of chemical structure information and experimental endpoints, as well as the statistical parameters used to verify the external predictivity have a strong influence on QSAR model reliability. Here, we emphasize the importance of these three aspects by analyzing our models on estrogen receptor binders (Endocrine disruptor knowledge base (EDKB) database). Endocrine disrupting chemicals, which mimic or antagonize the endogenous hormones such as estrogens, are a hot topic in environmental and toxicological sciences. QSAR shows great values in predicting the estrogenic activity and exploring the interactions between the estrogen receptor and ligands. We have verified our previously published model for additional external validation on new EDKB chemicals. Having found some errors in the used 3D molecular conformations, we redevelop a new model using the same data set with corrected structures, the same method (ordinary least-square regression, OLS) and DRAGON descriptors. The new model, based on some different descriptors, is more predictive on external prediction sets. Three different formulas to calculate correlation coefficient for the external prediction set (Q2 EXT) were compared, and the results indicated that the new proposal of Consonni et al. had more reasonable results, consistent with the conclusions from regression line, Williams plot and root mean square error (RMSE) values. Finally, the importance of reliable endpoints values has been highlighted by comparing the classification assignments of EDKB with those of another estrogen receptor binders database (METI): we found that 16.1% assignments of the common compounds were opposite (20 among 124 common compounds). In order to verify the real assignments for these inconsistent compounds, we predicted these samples, as a blind external set, by our regression models and compared the results with the two databases. The results indicated that most of the predictions were consistent with METI. Furthermore, we built a kNN classification model using the 104 consistent compounds to predict those inconsistent ones, and most of the predictions were also in agreement with METI database.

Antipsychotic medications have a diverse pharmacology with affinity for serotonergic, dopaminergic, adrenergic, histaminergic and cholinergic receptors. Their clinical use now also includes the treatment of mood disorders, thought to be mediated by serotonergic receptor activity. The aim of our study was to characterise the molecular properties of antipsychotic agents, and to develop a model that would indicate molecular specificity for the dopamine (D(2)) receptor and the serotonin (5-HT) transporter. Back-propagation artificial neural networks (ANNs) were trained on a dataset of 47 ligands categorically assigned antidepressant or antipsychotic utility. The structure of each compound was encoded with 63 calculated molecular descriptors. ANN parameters including hidden neurons and input descriptors were optimised based on sensitivity analyses, with optimum models containing between four and 14 descriptors. Predicted binding preferences were in excellent agreement with clinical antipsychotic or antidepressant utility. Validated models were further tested by use of an external prediction set of five drugs with unknown mechanism of action. The SAR models developed revealed the importance of simple molecular characteristics for differential binding to the D(2) receptor and the 5-HT transporter. These included molecular size and shape, solubility parameters, hydrogen donating potential, electrostatic parameters, stereochemistry and presence of nitrogen. The developed models and techniques employed are expected to be useful in the rational design of future therapeutic agents.

External Prediction Set Research Articles

Articles published on External Prediction Set

A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest

A Feasibility Study on Using near Infrared Spectroscopy to Classify Straw-Coal Blends

Confirmation of brand identity in foods by near infrared transflectance spectroscopy using classification and class-modelling chemometric techniques — The example of a Belgian beer

Prediction of PKCθ inhibitory activity using the Random Forest Algorithm.

Quantitative structure-retention relationship (QSRR) models for predicting the GC retention times of essential oil components

Structural Contributions of Substrates to their Binding to P-Glycoprotein. A TOPSMODE Approach

Recognition of tablet content by chemometric processing of differential scanning calorimetry curves—An acetaminophen example

Carbon nuclear magnetic resonance spectroscopic fingerprinting of commercial gasoline: Pattern-recognition analyses for screening quality control purposes

Prediction of the Q-e parameters from radical structures

Multivariate image analysis-thin layer chromatography (MIA-TLC) for simultaneous determination of co-eluting components

The importance of molecular structures, endpoints’ values, and predictivity parameters in QSAR research: QSAR analysis of a series of estrogen receptor binders

Use of Self-Training Artificial Neural Networks in a QSRR Study of a Diverse Set of Organic Compounds

A segmented principal component analysis–regression approach to quantitative structure–activity relationship modeling

Structure-Activity Relationships for Serotonin Transporter and Dopamine Receptor Selectivity

Quantitative Structure−Property Relationship Estimation of Cation Binding Affinity of the Common Amino Acids

QSPR Study of the Distribution Coefficient Property for Hydantoin and 5‐Arylidene Derivatives. A Genetic Algorithm Application for the Variable Selection in the MLR and PLS Methods

Quantitative Structure–Property Relationship Studies for Predicting Flash Points of Organic Compounds using Support Vector Machines

Accurate Prediction of Aquatic Toxicity of Aromatic Compounds Based on Genetic Algorithm and Least Squares Support Vector Machines

QSPR Prediction of pKa for Benzoic Acids in Different Solvents

Artificial Neural Networks in ADMET Modeling: Prediction of Blood–Brain Barrier Permeation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

External Prediction Set Research Articles

Articles published on External Prediction Set

A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest

A Feasibility Study on Using near Infrared Spectroscopy to Classify Straw-Coal Blends

Confirmation of brand identity in foods by near infrared transflectance spectroscopy using classification and class-modelling chemometric techniques — The example of a Belgian beer

Prediction of PKCθ inhibitory activity using the Random Forest Algorithm.

Quantitative structure-retention relationship (QSRR) models for predicting the GC retention times of essential oil components

Structural Contributions of Substrates to their Binding to P-Glycoprotein. A TOPSMODE Approach

Recognition of tablet content by chemometric processing of differential scanning calorimetry curves—An acetaminophen example

Carbon nuclear magnetic resonance spectroscopic fingerprinting of commercial gasoline: Pattern-recognition analyses for screening quality control purposes

Prediction of the Q-e parameters from radical structures

Multivariate image analysis-thin layer chromatography (MIA-TLC) for simultaneous determination of co-eluting components

The importance of molecular structures, endpoints’ values, and predictivity parameters in QSAR research: QSAR analysis of a series of estrogen receptor binders

Use of Self-Training Artificial Neural Networks in a QSRR Study of a Diverse Set of Organic Compounds

A segmented principal component analysis–regression approach to quantitative structure–activity relationship modeling

Structure-Activity Relationships for Serotonin Transporter and Dopamine Receptor Selectivity

Quantitative Structure−Property Relationship Estimation of Cation Binding Affinity of the Common Amino Acids

QSPR Study of the Distribution Coefficient Property for Hydantoin and 5‐Arylidene Derivatives. A Genetic Algorithm Application for the Variable Selection in the MLR and PLS Methods

Quantitative Structure–Property Relationship Studies for Predicting Flash Points of Organic Compounds using Support Vector Machines

Accurate Prediction of Aquatic Toxicity of Aromatic Compounds Based on Genetic Algorithm and Least Squares Support Vector Machines

QSPR Prediction of pKa for Benzoic Acids in Different Solvents

Artificial Neural Networks in ADMET Modeling: Prediction of Blood–Brain Barrier Permeation