Abstract

MotivationProtein backbone angle prediction has achieved significant accuracy improvement with the development of deep learning methods. Usually the same deep learning model is used in making prediction for all residues regardless of the categories of secondary structures they belong to. In this paper, we propose to train separate deep learning models for each category of secondary structures. Machine learning methods strive to achieve generality over the training examples and consequently loose accuracy. In this work, we explicitly exploit classification knowledge to restrict generalisation within the specific class of training examples. This is to compensate the loss of generalisation by exploiting specialisation knowledge in an informed way.ResultsThe new method named SAP4SS obtains mean absolute error (MAE) values of 15.59, 18.87, 6.03, and 21.71 respectively for four types of backbone angles phi, psi, theta, and tau. Consequently, SAP4SS significantly outperforms existing state-of-the-art methods SAP, OPUS-TASS, and SPOT-1D: the differences in MAE for all four types of angles are from 1.5 to 4.1% compared to the best known results.AvailabilitySAP4SS along with its data is available from https://gitlab.com/mahnewton/sap4ss.

Highlights

  • Proteins comprise amino acid (AA) sequences and fold into three dimensional (3D) structures

  • Availability: SAP4SS along with its data is available from https://gitlab.com/mahnewton/sap4ss

  • AAs can be of 20 types based on the uniqueness of the side chains that start from their Cα atoms

Read more

Summary

Introduction

Proteins comprise amino acid (AA) sequences and fold into three dimensional (3D) structures. The native structure of a protein has the minimum free energy and it determines the function of the protein. The protein structure prediction (PSP) problem is to determine the native structure of a protein from its AA sequence. The challenge comes from the astronomically large conformational search space and the unknown energy function involved in the folding process [2]. Proteins have backbones or main chains comprising peptide bonds that connect C and N atoms of successive AAs. AAs all have three common atoms N, Cα , and C in sequence. Protein backbone structures can be represented by a series of dihedral angles φi , ψi , and ωi. These dihedral angles are defined respectively by every four consecutive atoms from

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call