Abstract

Accurate protein contact map prediction is essential for de novo protein structure prediction. Over the past few years, deep learning has brought a significant breakthrough in protein contact map prediction and optimized deep learning architectures are highly desired for performance improvement. As an emerging deep learning architecture, the generative adversarial network (GAN) has shown the powerful capability of learning intrinsic patterns, which inspires us to comprehensively exploit GAN for predicting accurate protein contact maps. In this study, we present GANcon, a novel GAN-based deep learning architecture for protein contact map prediction, which to the best of our knowledge is the first GAN-based approach in this field. Instead of using a single neural network, GANcon is composed of two competitive networks that are evolving through adversarial learning. The generator network employs a dedicated encoder-decoder architecture that can efficiently capture the underlying contact information from versatile protein features to generate contact maps, while the discriminator network learns the differences between generated contact maps and real ones and promotes the generator network to produce more accurate contact maps. Moreover, to deal with the imbalance problem and take into account the symmetry of contact maps, we also propose a novel symmetrical focal loss, which can further enhance the effectiveness of adversarial learning for better performance. The experimental results on several datasets demonstrate that GANcon outperforms many state-of-the-art methods, indicating the effectiveness of our method for predicting protein contact maps. GANcon is freely available at https://github.com/melissaya/GANcon.

Highlights

  • Proteins are crucially important macromolecules in an organism and play a fundamental role in almost all biological processes

  • These results indicate the effectiveness of adversarial learning in protein contact map prediction

  • Accurate prediction of the protein contact map is of great significance in de novo protein structure prediction

Read more

Summary

Introduction

Proteins are crucially important macromolecules in an organism and play a fundamental role in almost all biological processes. In order to carry out the essential cellular function, proteins fold into specific three-dimensional structures, which are driven and stabilized by the interactions between. The associate editor coordinating the review of this manuscript and approving it for publication was Hualong Yu. protein residues, i.e., protein residue contacts. All contacts of residue pairs can be encoded into a binary matrix named ‘contact map’, which has been regarded as a critical contributor for accurate de novo protein structure prediction [1], [2]. In recent Critical Assessment of protein Structure Prediction (CASP) experiments, many excellent de novo protein structure prediction methods have benefited much from the incorporation of predicted contact maps [3], [4].

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.