Ranking Molecules with Vanishing Kernels and a Single Parameter: Active Applicability Domain Included.

Francois Berenger,Yoshihiro Yamanishi

doi:10.1021/acs.jcim.9b01075

Abstract

In ligand-based virtual screening, high-throughput screening (HTS) data sets can be exploited to train classification models. Such models can be used to prioritize yet untested molecules, from the most likely active (against a protein target of interest) to the least likely active. In this study, a single-parameter ranking method with an Applicability Domain (AD) is proposed. In effect, Kernel Density Estimates (KDE) are revisited to improve their computational efficiency and incorporate an AD. Two modifications are proposed: (i) using vanishing kernels (i.e., kernel functions with a finite support) and (ii) using the Tanimoto distance between molecular fingerprints as a radial basis function. This construction is termed "Vanishing Ranking Kernels" (VRK). Using VRK on 21 HTS assays, it is shown that VRK can compete in performance with a graph convolutional deep neural network. VRK are conceptually simple and fast to train. During training, they require optimizing a single parameter. A trained VRK model usually defines an active AD. Exploiting this AD can significantly increase the screening frequency of a VRK model. Software: https://github.com/UnixJunkie/rankers. Data sets: https://zenodo.org/record/1320776 and https://zenodo.org/record/3540423.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ranking Molecules with Vanishing Kernels and a Single Parameter: Active Applicability Domain Included.

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and modeling

Lead the way for us

Journal: Journal of chemical information and modeling	Publication Date: Apr 13, 2020
Citations: 3

Similar Papers

Analysis of a High-Throughput Screening Data Set Using Potency-Scaled Molecular Similarity Algorithms
Ingo Vogt ... Jürgen Bajorath
Journal of Chemical Information and Modeling | VOL. 47
Ingo Vogt, et. al.Ingo Vogt ... Jürgen Bajorath
15 Feb 2007
Journal of Chemical Information and Modeling | VOL. 47

A Distance-Based Boolean Applicability Domain for Classification of High Throughput Screening Data.
Francois Berenger ... Yoshihiro Yamanishi
Journal of Chemical Information and Modeling | VOL. 59
Francois Berenger, et. al.Francois Berenger ... Yoshihiro Yamanishi
19 Dec 2018
Journal of Chemical Information and Modeling | VOL. 59

Hybrid text classification model based on graph convolution network and neural network
Zhaohe Dong ... Zhengli Zhai
-
Zhaohe Dong, et. al.Zhaohe Dong ... Zhengli Zhai
01 Jun 2023
01 Jun 2023

GPU Accelerated Support Vector Machines for Mining High-Throughput Screening Data
Quan Liao ... Jibo Wang
Journal of Chemical Information and Modeling | VOL. 49
Quan Liao, et. al.Quan Liao ... Jibo Wang
04 Dec 2009
Journal of Chemical Information and Modeling | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ranking Molecules with Vanishing Kernels and a Single Parameter: Active Applicability Domain Included.

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and modeling