Combination Of Descriptors Research Articles

Quantitative Structure Activity Relationship (QSAR) models can inform on the correlation between activities and structure-based molecular descriptors. This information is important for the understanding of the factors that govern molecular properties and for designing new compounds with favorable properties. Due to the large number of calculate-able descriptors and consequently, the much larger number of descriptors combinations, the derivation of QSAR models could be treated as an optimization problem. For continuous responses, metrics which are typically being optimized in this process are related to model performances on the training set, for example, and . Similar metrics, calculated on an external set of data (e.g., ), are used to evaluate the performances of the final models. A common theme of these metrics is that they are context -” ignorant”. In this work we propose that QSAR models should be evaluated based on their intended usage. More specifically, we argue that QSAR models developed for Virtual Screening (VS) should be derived and evaluated using a virtual screening-aware metric, e.g., an enrichment-based metric. To demonstrate this point, we have developed 21 Multiple Linear Regression (MLR) models for seven targets (three models per target), evaluated them first on validation sets and subsequently tested their performances on two additional test sets constructed to mimic small-scale virtual screening campaigns. As expected, we found no correlation between model performances evaluated by “classical” metrics, e.g., and and the number of active compounds picked by the models from within a pool of random compounds. In particular, in some cases models with favorable and/or values were unable to pick a single active compound from within the pool whereas in other cases, models with poor and/or values performed well in the context of virtual screening. We also found no significant correlation between the number of active compounds correctly identified by the models in the training, validation and test sets. Next, we have developed a new algorithm for the derivation of MLR models by optimizing an enrichment-based metric and tested its performances on the same datasets. We found that the best models derived in this manner showed, in most cases, much more consistent results across the training, validation and test sets and outperformed the corresponding MLR models in most virtual screening tests. Finally, we demonstrated that when tested as binary classifiers, models derived for the same targets by the new algorithm outperformed Random Forest (RF) and Support Vector Machine (SVM)-based models across training/validation/test sets, in most cases. We attribute the better performances of the Enrichment Optimizer Algorithm (EOA) models in VS to better handling of inactive random compounds. Optimizing an enrichment-based metric is therefore a promising strategy for the derivation of QSAR models for classification and virtual screening.

Read full abstract

Summary Objectives : To summarize key contributions to current research in the field of Clinical Research Informatics (CRI) and to select best papers published in 2019. Method : A bibliographic search using a combination of MeSH descriptors and free-text terms on CRI was performed using PubMed, followed by a double-blind review in order to select a list of candidate best papers to be then peer-reviewed by external reviewers. After peer-review ranking, a consensus meeting between the two section editors and the editorial team was organized to finally conclude on the selected three best papers. Results : Among the 517 papers, published in 2019, returned by the search, that were in the scope of the various areas of CRI, the full review process selected three best papers. The first best paper describes the use of a homomorphic encryption technique to enable federated analysis of real-world data while complying more easily with data protection requirements. The authors of the second best paper demonstrate the evidence value of federated data networks reporting a large real world data study related to the first line treatment for hypertension. The third best paper reports the migration of the US Food and Drug Administration (FDA) adverse event reporting system database to the OMOP common data model. This work opens the combined analysis of both spontaneous reporting system and electronic health record (EHR) data for pharmacovigilance. Conclusions : The most significant research efforts in the CRI field are currently focusing on real world evidence generation and especially the reuse of EHR data. With the progress achieved this year in the areas of phenotyping, data integration, semantic interoperability, and data quality assessment, real world data is becoming more accessible and reusable. High quality data sets are key assets not only for large scale observational studies or for changing the way clinical trials are conducted but also for developing or evaluating artificial intelligence algorithms guiding clinical decision for more personalized care. And lastly, security and confidentiality, ethical and regulatory issues, and more generally speaking data governance are still active research areas this year.

Read full abstract

Combination Of Descriptors Research Articles

Related Topics

Articles published on Combination Of Descriptors

Evaluation of QSAR Equations for Virtual Screening.

Percepção de usuárias no climatério sobre as práticas integrativas

A Sequence-segment Neighbor Encoding Schema for Protein Hotspot Residue Prediction

Molecular Docking and QSAR Studies of Coumarin Derivatives as NMT Inhibitors: Simple Structural Features as Potential Modulators of Antifungal Activity

Risk prediction model using eye movements during simulated driving with logistic regressions and neural networks

Structure–Property Correlation for Calculating the Critical Pressures of Liquid–Vapor Phase Transitions from the Topological Characteristics of Alkene Molecules

Predictive Modeling of Angiotensin I-Converting Enzyme Inhibitory Peptides Using Various Machine Learning Approaches.

Systemic Evaluation of the Effects of Regional Self-Supply Targets on the German Electricity System Using Consistent Scenarios and System Optimization

Human motion recognition based on limit learning machine

Conformational analysis and QSAR modeling of 14-membered macrolide analogues against mycobacterium tuberculosis

First report on chemometric modeling of hydrolysis half-lives of organic chemicals.

Predicting Chemical-Induced Liver Toxicity Using High-Content Imaging Phenotypes and Chemical Descriptors: A Random Forest Approach.

Máscaras de proteção e os reveses da busca por itens de proteção individual em tempos de COVID-19 - uma revisão integrativa

Enhancement of student performance prediction using modified K-nearest neighbor

Clinical Research Informatics.

MAIN COMPLICATIONS IN INTESTINAL OSTOMIES: AN INTEGRATIVE REVIEW

Studies on the IC50 of Metabolically Stable 1-(3,3-diphenylpropyl)- piperidinyl Amides and Ureas as Human CCR5 Receptor Antagonists Based on QSAR

Treatment Adherence of Patients with Chronic Kidney Disease: an Integrative Literature Review

ECG heartbeat classification by means of variable rational projection

Image Retrieval Based on the Combination of Region and Orientation Correlation Descriptors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Combination Of Descriptors Research Articles

Related Topics

Articles published on Combination Of Descriptors

Evaluation of QSAR Equations for Virtual Screening.

Percepção de usuárias no climatério sobre as práticas integrativas

A Sequence-segment Neighbor Encoding Schema for Protein Hotspot Residue Prediction

Molecular Docking and QSAR Studies of Coumarin Derivatives as NMT Inhibitors: Simple Structural Features as Potential Modulators of Antifungal Activity

Risk prediction model using eye movements during simulated driving with logistic regressions and neural networks

Structure–Property Correlation for Calculating the Critical Pressures of Liquid–Vapor Phase Transitions from the Topological Characteristics of Alkene Molecules

Predictive Modeling of Angiotensin I-Converting Enzyme Inhibitory Peptides Using Various Machine Learning Approaches.

Systemic Evaluation of the Effects of Regional Self-Supply Targets on the German Electricity System Using Consistent Scenarios and System Optimization

Human motion recognition based on limit learning machine

Conformational analysis and QSAR modeling of 14-membered macrolide analogues against mycobacterium tuberculosis

First report on chemometric modeling of hydrolysis half-lives of organic chemicals.

Predicting Chemical-Induced Liver Toxicity Using High-Content Imaging Phenotypes and Chemical Descriptors: A Random Forest Approach.

Máscaras de proteção e os reveses da busca por itens de proteção individual em tempos de COVID-19 - uma revisão integrativa

Enhancement of student performance prediction using modified K-nearest neighbor

Clinical Research Informatics.

MAIN COMPLICATIONS IN INTESTINAL OSTOMIES: AN INTEGRATIVE REVIEW

Studies on the IC50 of Metabolically Stable 1-(3,3-diphenylpropyl)- piperidinyl Amides and Ureas as Human CCR5 Receptor Antagonists Based on QSAR

Treatment Adherence of Patients with Chronic Kidney Disease: an Integrative Literature Review

ECG heartbeat classification by means of variable rational projection

Image Retrieval Based on the Combination of Region and Orientation Correlation Descriptors