Comparability of Mixed IC50 Data – A Statistical Analysis

Tuomo Kalliokoski,Peter Gedeck,Anna Vulpetti,Christian Kramer

doi:10.1371/journal.pone.0061007

Abstract

The biochemical half maximal inhibitory concentration (IC50) is the most commonly used metric for on-target activity in lead optimization. It is used to guide lead optimization, build large-scale chemogenomics analysis, off-target activity and toxicity models based on public data. However, the use of public biochemical IC50 data is problematic, because they are assay specific and comparable only under certain conditions. For large scale analysis it is not feasible to check each data entry manually and it is very tempting to mix all available IC50 values from public database even if assay information is not reported. As previously reported for Ki database analysis, we first analyzed the types of errors, the redundancy and the variability that can be found in ChEMBL IC50 database. For assessing the variability of IC50 data independently measured in two different labs at least ten IC50 data for identical protein-ligand systems against the same target were searched in ChEMBL. As a not sufficient number of cases of this type are available, the variability of IC50 data was assessed by comparing all pairs of independent IC50 measurements on identical protein-ligand systems. The standard deviation of IC50 data is only 25% larger than the standard deviation of Ki data, suggesting that mixing IC50 data from different assays, even not knowing assay conditions details, only adds a moderate amount of noise to the overall data. The standard deviation of public ChEMBL IC50 data, as expected, resulted greater than the standard deviation of in-house intra-laboratory/inter-day IC50 data. Augmenting mixed public IC50 data by public Ki data does not deteriorate the quality of the mixed IC50 data, if the Ki is corrected by an offset. For a broad dataset such as ChEMBL database a Ki- IC50 conversion factor of 2 was found to be the most reasonable.

Highlights

IntroductionPublic collections of IC50 data (the half maximal inhibitory concentrations of ligands on their protein targets) represent a wealth of knowledge on bioactivity with growing importance
Public collections of IC50 data represent a wealth of knowledge on bioactivity with growing importance
In order to assess the comparability of IC50 values, we first extracted all series of compounds that have been measured against the same protein target in two independent assays from whole ChEMBL

Summary

Introduction

Public collections of IC50 data (the half maximal inhibitory concentrations of ligands on their protein targets) represent a wealth of knowledge on bioactivity with growing importance. [2] Proper usage of IC50 data facilitates the development of useful methods for drug discovery. Examples of such applications are the global mapping of pharmacological space by Paolini and coworkers, [3] the Similarity Ensemble Approach (SEA), [4] the Bayesian models for adverse drug reactions by Bender and coworkers, [5] the models used for polypharmacological optimization by Hopkins et al, [6] and the kinome-wide activity modeling studies by Schuerer and Muskal. [7] These methods can be used to predict off-target effects based on heterogeneous public activity data and chemical similarity analysis. For the simplest typical case of competitive monosubstrate enzyme inhibition, Ki can be calculated from the IC50 according to the Cheng-Prusoff equation: Ki

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Apr 16, 2013
Citations: 251	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Comparability of Mixed IC50 Data – A Statistical Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Burden Testing of Rare Variants Identified through Exome Sequencing via Publicly Available Control Data
Michael H Guo ... Margaret F Lippincott
The American Journal of Human Genetics | VOL. 103
Michael H Guo, et. al.Michael H Guo ... Margaret F Lippincott
27 Sep 2018
The American Journal of Human Genetics | VOL. 103

Mandatory submission of microarray data to public repositories: how is it working?
Beverly Ventura
Physiological Genomics | VOL. 20
Beverly VenturaBeverly Ventura
20 Jan 2005
Physiological Genomics | VOL. 20

MBLabDB: a social database for molecular biodiversity data
Flavio Licciulli ... Saverio Vicario
EMBnet.journal | VOL. 18
Flavio Licciulli, et. al.Flavio Licciulli ... Saverio Vicario
09 Nov 2012
EMBnet.journal | VOL. 18

Public data evolution games on complex networks and data quality control
Wenqi Liu
SCIENTIA SINICA Informationis | VOL. 46
Wenqi LiuWenqi Liu
01 Nov 2016
SCIENTIA SINICA Informationis | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparability of Mixed IC50 Data – A Statistical Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE