A Comparative Assessment of Predictive Accuracies of Conventional and Machine Learning Scoring Functions for Protein-Ligand Binding Affinity Prediction

Hossam M Ashtawy,Nihar R Mahapatra

doi:10.1109/tcbb.2014.2351824

Abstract

Accurately predicting the binding affinities of large diverse sets of protein-ligand complexes efficiently is a key challenge in computational biomolecular science, with applications in drug discovery, chemical biology, and structural biology. Since a scoring function (SF) is used to score, rank, and identify potential drug leads, the fidelity with which it predicts the affinity of a ligand candidate for a protein's binding site has a significant bearing on the accuracy of virtual screening. Despite intense efforts in developing conventional SFs, which are either force-field based, knowledge-based, or empirical, their limited predictive accuracy has been a major roadblock toward cost-effective drug discovery. Therefore, in this work, we explore a range of novel SFs employing different machine-learning (ML) approaches in conjunction with a variety of physicochemical and geometrical features characterizing protein-ligand complexes. We assess the scoring accuracies of these new ML SFs as well as those of conventional SFs in the context of the 2007 and 2010 PDBbind benchmark datasets on both diverse and protein-family-specific test sets. We also investigate the influence of the size of the training dataset and the type and number of features used on scoring accuracy. We find that the best performing ML SF has a Pearson correlation coefficient of 0.806 between predicted and measured binding affinities compared to 0.644 achieved by a state-of-the-art conventional SF. We also find that ML SFs benefit more than their conventional counterparts from increases in the number of features and the size of training dataset. In addition, they perform better on novel proteins that they were never trained on before.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Mar 1, 2015
Citations: 57	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

A Comparative Assessment of Predictive Accuracies of Conventional and Machine Learning Scoring Functions for Protein-Ligand Binding Affinity Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Similar Papers

Does Accurate Scoring of Ligands against Protein Targets Mean Accurate Ranking?
Nihar R Mahapatra ... Hossam M Ashtawy
-
Nihar R Mahapatra, et. al.Nihar R Mahapatra ... Hossam M Ashtawy
01 Jan 2013
01 Jan 2013

A Comparative Assessment of Ranking Accuracies of Conventional and Machine-Learning-Based Scoring Functions for Protein-Ligand Binding Affinity Prediction
Hossam M Ashtawy ... Nihar R Mahapatra
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 9
Hossam M Ashtawy, et. al.Hossam M Ashtawy ... Nihar R Mahapatra
01 Sep 2012
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 9

A Comparative Assessment of Conventional and Machine-Learning-Based Scoring Functions in Predicting Binding Affinities of Protein-Ligand Complexes
Nihar R Mahapatra ... Hossam M Ashtawy
-
Nihar R Mahapatra, et. al.Nihar R Mahapatra ... Hossam M Ashtawy
01 Nov 2011
01 Nov 2011

BgN-Score and BsN-Score: bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes.
Nihar R Mahapatra ... Hossam M Ashtawy
BMC bioinformatics | VOL. Suppl 16 4
Nihar R Mahapatra, et. al.Nihar R Mahapatra ... Hossam M Ashtawy
23 Feb 2015
BMC bioinformatics | VOL. Suppl 16 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparative Assessment of Predictive Accuracies of Conventional and Machine Learning Scoring Functions for Protein-Ligand Binding Affinity Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics