Abstract

β-turns are the most common type of non-repetitive structures, and constitute on average 25% of the amino acids in proteins. The formation of β-turns plays an important role in protein folding, protein stability and molecular recognition processes. In this work we present the neural network method NetTurnP, for prediction of two-class β-turns and prediction of the individual β-turn types, by use of evolutionary information and predicted protein sequence features. It has been evaluated against a commonly used dataset BT426, and achieves a Matthews correlation coefficient of 0.50, which is the highest reported performance on a two-class prediction of β-turn and not-β-turn. Furthermore NetTurnP shows improved performance on some of the specific β-turn types. In the present work, neural network methods have been trained to predict β-turn or not and individual β-turn types from the primary amino acid sequence. The individual β-turn types I, I', II, II', VIII, VIa1, VIa2, VIba and IV have been predicted based on classifications by PROMOTIF, and the two-class prediction of β-turn or not is a superset comprised of all β-turn types. The performance is evaluated using a golden set of non-homologous sequences known as BT426. Our two-class prediction method achieves a performance of: MCC = 0.50, Qtotal = 82.1%, sensitivity = 75.6%, PPV = 68.8% and AUC = 0.864. We have compared our performance to eleven other prediction methods that obtain Matthews correlation coefficients in the range of 0.17 – 0.47. For the type specific β-turn predictions, only type I and II can be predicted with reasonable Matthews correlation coefficients, where we obtain performance values of 0.36 and 0.31, respectively.ConclusionThe NetTurnP method has been implemented as a webserver, which is freely available at http://www.cbs.dtu.dk/services/NetTurnP/. NetTurnP is the only available webserver that allows submission of multiple sequences.

Highlights

  • The secondary structure of a protein can be classified as local structural elements of a-helices, b-strands and coil regions

  • Several second layer network setups were tested in order to find the architecture with the highest cross-validated Matthews correlation coefficient [42] (MCC) value based on training set sequences

  • NetTurnP BT426 7-fold is referring to a 7-fold cross-validation performed on the BT426 dataset

Read more

Summary

Introduction

The secondary structure of a protein can be classified as local structural elements of a-helices, b-strands and coil regions. The latter is often thought of as unstructured regions, but do contain ordered local structures such as a-turns, c-turns, d-turns, p-turns, b-turns, bulges and random coil structures [1,2]. A further classification can be made based on the backbone dihedral angles phi and psi. B-turn types are classified according to the dihedral angles (W and y) between amino acid residues i+1 and i+2 [3,4].

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call