Searching for Target-Selective Compounds Using Different Combinations of Multiclass Support Vector Machine Ranking Methods, Kernel Functions, and Fingerprint Descriptors

Anne Mai Wassermann,Jürgen Bajorath,Hanna Geppert

doi:10.1021/ci800441c

Anne Mai Wassermann, Jürgen Bajorath + Show 1 more

https://doi.org/10.1021/ci800441c

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

The identification of small chemical compounds that are selective for a target protein over one or more closely related members of the same family is of high relevance for applications in chemical biology. Conventional 2D similarity searching using known selective molecules as templates has recently been found to preferentially detect selective over non-selective and inactive database compounds. To improve the initially observed search performance, we have attempted to use 2D fingerprints as descriptors for support vector machine (SVM)-based selectivity searching. Different from typically applied binary SVM compound classification, SVM analysis has been adapted here for multiclass predictions and compound ranking to distinguish between selective, active but non-selective, and inactive compounds. In systematic database search calculations, we tested combinations of four alternative SVM ranking schemes, four different kernel functions, and four fingerprints and were able to further improve selectivity search performance by effectively removing non-selective molecules from high ranking positions while retaining high recall of selective compounds.

Full Text