True Accuracy of Fast Scoring Functions to Predict High-Throughput Screening Data from Docking Poses: The Simpler the Better.

Viet-Khoa Tran-Nguyen,Didier Rognan,Guillaume Bret

doi:10.1021/acs.jcim.1c00292

Abstract

Hundreds of fast scoring functions have been developed over the last 20 years to predict binding free energies from three-dimensional structures of protein-ligand complexes. Despite numerous statistical promises, we believe that none of them has been properly validated for daily prospective high-throughput virtual screening studies, mostly because in silico screening challenges usually employ artificially built and biased datasets. We here carry out a fully unbiased evaluation of four scoring functions (Pafnucy, ΔvinaRF20, IFP, and GRIM) on an in-house developed data collection of experimental high-confidence screening data (LIT-PCBA) covering about 3 million data points on 15 diverse pharmaceutical targets. All four scoring functions were applied to rescore the docking poses of LIT-PCBA compounds in conditions mimicking exactly standard drug discovery scenarios and were compared in terms of propensity to enrich true binders in the top 1%-ranked hit lists. Interestingly, rescoring based on simple interaction fingerprints or interaction graphs outperforms state-of-the-art machine learning and deep learning scoring functions in most of the cases. The current study notably highlights the strong tendency of deep learning methods to predict affinity values within a very narrow range centered on the mean value of samples used for training. Moreover, it suggests that knowledge of pre-existing binding modes is the key to detecting the most potent binders.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

True Accuracy of Fast Scoring Functions to Predict High-Throughput Screening Data from Docking Poses: The Simpler the Better.

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling

Lead the way for us

Journal: Journal of Chemical Information and Modeling	Publication Date: Jun 10, 2021
Citations: 37

Similar Papers

Comparative assessment of machine-learning scoring functions on PDBbind 2013
Mohamed A Khamis ... Walid Gomaa
Engineering Applications of Artificial Intelligence | VOL. 45
Mohamed A Khamis, et. al.Mohamed A Khamis ... Walid Gomaa
16 Jul 2015
Comparative assessment of machine-learning scoring functions on PDBbind 2013
Mohamed A Khamis ... Walid Gomaa

Machine learning in computational docking.
Mohamed A Khamis ... Walaa F Ahmed
Artificial Intelligence in Medicine | VOL. 63
Mohamed A Khamis, et. al.Mohamed A Khamis ... Walaa F Ahmed
16 Feb 2015
Artificial Intelligence in Medicine | VOL. 63

N-Ethylmaleimide sensitive factor in the cortex of subjects with schizophrenia and bipolar I disorder
Laura Gray ... Brian Dean
Neuroscience Letters | VOL. 391
Laura Gray, et. al.Laura Gray ... Brian Dean
13 Sep 2005
Neuroscience Letters | VOL. 391

Author response: Comprehensive and unbiased multiparameter high-throughput screening by compaRe finds effective and subtle drug responses in AML models
Ella Karjalainen ... Kirsten Grønbæk
-
Ella Karjalainen, et. al.Ella Karjalainen ... Kirsten Grønbæk
25 Jan 2022
25 Jan 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

True Accuracy of Fast Scoring Functions to Predict High-Throughput Screening Data from Docking Poses: The Simpler the Better.

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling