Predictiveness curves in virtual screening.

Charly Empereur-Mot,Aurélien Latouche,Vivian Viallon,Matthieu Montes,Jean-François Zagury,Hélène Guillemain

doi:10.1186/s13321-015-0100-8

Abstract

BackgroundIn the present work, we aim to transfer to the field of virtual screening the predictiveness curve, a metric that has been advocated in clinical epidemiology. The literature describes the use of predictiveness curves to evaluate the performances of biological markers to formulate diagnoses, prognoses and assess disease risks, assess the fit of risk models, and estimate the clinical utility of a model when applied to a population. Similarly, we use logistic regression models to calculate activity probabilities related to the scores that the compounds obtained in virtual screening experiments. The predictiveness curve can provide an intuitive and graphical tool to compare the predictive power of virtual screening methods.ResultsSimilarly to ROC curves, predictiveness curves are functions of the distribution of the scores and provide a common scale for the evaluation of virtual screening methods. Contrarily to ROC curves, the dispersion of the scores is well described by predictiveness curves. This property allows the quantification of the predictive performance of virtual screening methods on a fraction of a given molecular dataset and makes the predictiveness curve an efficient tool to address the early recognition problem. To this last end, we introduce the use of the total gain and partial total gain to quantify recognition and early recognition of active compounds attributed to the variations of the scores obtained with virtual screening methods. Additionally to its usefulness in the evaluation of virtual screening methods, predictiveness curves can be used to define optimal score thresholds for the selection of compounds to be tested experimentally in a drug discovery program. We illustrate the use of predictiveness curves as a complement to ROC on the results of a virtual screening of the Directory of Useful Decoys datasets using three different methods (Surflex-dock, ICM, Autodock Vina).ConclusionThe predictiveness curves cover different aspects of the predictive power of the scores, allowing a detailed evaluation of the performance of virtual screening methods. We believe predictiveness curves efficiently complete the set of tools available for the analysis of virtual screening results.Electronic supplementary materialThe online version of this article (doi:10.1186/s13321-015-0100-8) contains supplementary material, which is available to authorized users.

Highlights

In the present work, we aim to transfer to the field of virtual screening the predictiveness curve, a metric that has been advocated in clinical epidemiology
Assessment of the predictive power of a scoring function We first illustrated the use of the predictiveness curve as a complement to the receiver operating characteristics (ROC) curve with the results obtained from Surflex-dock, ICM, and Autodock Vina on target retinoic X receptor (RXR) of the directory of useful decoys (DUD) dataset (Fig. 2)
The ROC curve profiles suggested that acceptable early recognition has been achieved by the three methods (Surflex-dock pAUC2 %: 0.167, ICM pAUC2 %: 0.342, Autodock Vina pAUC2 %: 0.330), which was confirmed in terms of enrichment (Surflex-dock EF2 %: 16.84, ICM EF2 %: 24.06, Autodock Vina EF2 %: 26.47)

Summary

Introduction

We aim to transfer to the field of virtual screening the predictiveness curve, a metric that has been advocated in clinical epidemiology. The aim of virtual screening methods is to enrich a subset of molecules in potentially active compounds In this context, different metrics have emerged to evaluate the performance of virtual screening methods: enrichment factors (EFs), receiver operating characteristics (ROC) curves [2], the area under the ROC curve. Virtual screening methods are used to prioritize a subset of the screened compound collection for experimental testing, whereas ROC curves and ROC AUC summarize the ability of a method to rank a database over its entirety [4, 6]. The true positive fraction (TPF) and false positive fraction (FPF) of the ROC plot can reflect a very different number of compounds on an identical scale, which can be misleading for analyzing the early recognition of active compounds

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Cheminformatics	Publication Date: Nov 4, 2015
Citations: 78	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Predictiveness curves in virtual screening.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Cheminformatics

Lead the way for us

Similar Papers

Virtual Screening, Molecular Docking and QSAR Studies in Drug Discovery and Development Programme
Mithun Rudrapal ... Dipak Chetia
Journal of Drug Delivery and Therapeutics | VOL. 10
Mithun Rudrapal, et. al.Mithun Rudrapal ... Dipak Chetia
15 Jul 2020
Journal of Drug Delivery and Therapeutics | VOL. 10

Optimization Methods for Virtual Screening on Novel Computational Architectures
Horacio Perez-Sanchez ... Wolfgang Wenzel
Current Computer Aided-Drug Design | VOL. 7
Horacio Perez-Sanchez, et. al.Horacio Perez-Sanchez ... Wolfgang Wenzel
01 Mar 2011
Current Computer Aided-Drug Design | VOL. 7

PI3 kinase and inhibitors: targeting isoform selectivity

-

22 Mar 2017
22 Mar 2017

Ligand protonation states and stereoisomers in virtual screening
T Ten Brink ... Te Exner
Chemistry Central Journal | VOL. 3
T Ten Brink, et. al.T Ten Brink ... Te Exner
01 Jun 2009
Chemistry Central Journal | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predictiveness curves in virtual screening.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Cheminformatics