Abstract

Target fishing often relies on the use of reverse docking to identify potential target proteins of ligands from protein database. The limitation of reverse docking is the accuracy of current scoring funtions used to distinguish true target from non-target proteins. Many contemporary scoring functions are designed for the virtual screening of small molecules without special optimization for reverse docking, which would be easily influenced by the properties of protein pockets, resulting in scoring bias to the proteins with certain properties. This bias would cause lots of false positives in reverse docking, interferring the identification of true targets. In this paper, we have conducted a large-scale reverse docking (5000 molecules to 100 proteins) to study the scoring bias in reverse docking by DOCK, Glide, and AutoDock Vina. And we found that there were actually some frequency hits, namely interference proteins in all three docking procedures. After analyzing the differences of pocket properties between these interference proteins and the others, we speculated that the interference proteins have larger contact area (related to the size and shape of protein pockets) with ligands (for all three docking programs) or higher hydrophobicity (for Glide), which could be the causes of scoring bias. Then we applied the score normalization method to eliminate this scoring bias, which was effective to make docking score more balanced between different proteins in the reverse docking of benchmark dataset. Later, the Astex Diver Set was utilized to validate the effect of score normalization on actual cases of reverse docking, showing that the accuracy of target prediction significantly increased by 21.5% in the reverse docking by Glide after score normalization, though there was no obvious change in the reverse docking by DOCK and AutoDock Vina. Our results demonstrate the effectiveness of score normalization to eliminate the scoring bias and improve the accuracy of target prediction in reverse docking. Moreover, the properties of protein pockets causing scoring bias to certain proteins we found here can provide the theory basis to further optimize the scoring functions of docking programs for future research.

Highlights

  • Small molecule drugs are rarely selective enough to interact solely with their designated targets

  • The score range of 1d3g and 3kgc is -14.4 to 52.3, and -8.9 to -1.5 respectively. Their average score is –9.1 and -5.5. These results demonstrate that scoring functions may have different degrees of scoring bias to different proteins, some proteins resulting in a high overall docking score

  • We found that the interference proteins had larger contact area with ligands or higher hydrophobicity, which could be the possible causes of scoring bias

Read more

Summary

Introduction

Small molecule drugs are rarely selective enough to interact solely with their designated targets. The ability of small molecules to interact with multiple proteins provides the basis to develop multitarget drugs[7, 8]. Known as target fishing, helps to identify the potential targets of a query molecule. It may reveal targets of drugs with so far unknown mechanisms-of-action[9], contribute to rationally designing of less toxic or multitarget drugs[10,11,12,13], and reveal hidden opportunities in drug repurposing projects[14,15,16]

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call