Abstract
Comprehensive characterization of ligand-binding sites is invaluable to infer molecular functions of hypothetical proteins, trace evolutionary relationships between proteins, engineer enzymes to achieve a desired substrate specificity, and develop drugs with improved selectivity profiles. These research efforts pose significant challenges owing to the fact that similar pockets are commonly observed across different folds, leading to the high degree of promiscuity of ligand-protein interactions at the system-level. On that account, novel algorithms to accurately classify binding sites are needed. Deep learning is attracting a significant attention due to its successful applications in a wide range of disciplines. In this communication, we present DeepDrug3D, a new approach to characterize and classify binding pockets in proteins with deep learning. It employs a state-of-the-art convolutional neural network in which biomolecular structures are represented as voxels assigned interaction energy-based attributes. The current implementation of DeepDrug3D, trained to detect and classify nucleotide- and heme-binding sites, not only achieves a high accuracy of 95%, but also has the ability to generalize to unseen data as demonstrated for steroid-binding proteins and peptidase enzymes. Interestingly, the analysis of strongly discriminative regions of binding pockets reveals that this high classification accuracy arises from learning the patterns of specific molecular interactions, such as hydrogen bonds, aromatic and hydrophobic contacts. DeepDrug3D is available as an open-source program at https://github.com/pulimeng/DeepDrug3D with the accompanying TOUGH-C1 benchmarking dataset accessible from https://osf.io/enz69/.
Highlights
Proteins constitute a diverse group of biological macromolecules essential for the vast majority of processes in living organisms
The resulting wealth of structural data collected for a large number of organisms across all domains of life are available from the Protein Data Bank (PDB) [1]
Computational approaches to detect and analyze ligand-protein interactions notably contribute to numerous resources cataloging biological complexes, such as sc-PDB [2], BioLiP [3], PDBbind [4], Relibase [5], and the Protein-Ligand Interaction Clusters, or PLIC, database [6]
Summary
Small organic ligands bind to the locations of chemical specificity and affinity on their protein targets, called binding sites. Deep learning-based classification of ligand-binding sites and analysis, decision to publish, or preparation of the manuscript. DeepDrug3D is able to accurately classify binding sites by learning the patterns of specific molecular interactions between ligands and their protein targets, such as hydrogen bonds, aromatic and hydrophobic contacts. The current proof-of-concept implementation is limited to a few most abundant functional classes, the repertoire of pocket types handled by DeepDrug3D will significantly be expanded in the near future. This is a PLOS Computational Biology Software paper
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.