Abstract

BackgroundBecause loops connect regular secondary structures, analysis of the former depends directly on the definition of the latter. The numerous assignment methods, however, can offer different definitions. In a previous study, we defined a structural alphabet composed of 16 average protein fragments, which we called Protein Blocks (PBs). They allow an accurate description of every region of 3D protein backbones and have been used in local structure prediction. In the present study, we use this structural alphabet to analyze and predict the loops connecting two repetitive structures.ResultsWe first analyzed the secondary structure assignments. Use of five different assignment methods (DSSP, DEFINE, PCURVE, STRIDE and PSEA) showed the absence of consensus: 20% of the residues were assigned to different states. The discrepancies were particularly important at the extremities of the repetitive structures. We used PBs to describe and predict the short loops because they can help analyze and in part explain these discrepancies. An analysis of the PB distribution in these regions showed some specificities in the sequence-structure relationship. Of the amino acid over- or under-representations observed in the short loop databank, 20% did not appear in the entire databank. Finally, predicting 3D structure in terms of PBs with a Bayesian approach yielded an accuracy rate of 36.0% for all loops and 41.2% for the short loops. Specific learning in the short loops increased the latter by 1%.ConclusionThis work highlights the difficulties of assigning repetitive structures and the advantages of using more precise descriptions, that is, PBs. We observed some new amino acid distributions in the short loops and used this information to enhance local prediction. Instead of describing entire loops, our approach predicts each position in the loops locally. It can thus be used to propose many different structures for the loops and to probe and sample their flexibility. It can be a useful tool in ab initio loop prediction.

Highlights

  • Because loops connect regular secondary structures, analysis of the former depends directly on the definition of the latter

  • We focused on the frequency of occurrence of Protein Blocks (PBs) in these regions and on the main transitions between successive PBs, since previous studies observed only a limited number of transitions [19,23]

  • The prediction gives a probability for every PB at each position

Read more

Summary

Introduction

Because loops connect regular secondary structures, analysis of the former depends directly on the definition of the latter. We defined a structural alphabet composed of 16 average protein fragments, which we called Protein Blocks (PBs) They allow an accurate description of every region of 3D protein backbones and have been used in local structure prediction. Since the first descriptions of protein structures by Pauling and Corey [1,2], their repetitive secondary structures have been widely analyzed They have been studied from two principal points of view – assignment and prediction. PSEA [7] bases its assignments only on the Cα position, using distance and angle criteria Not surprisingly, these methods do not assign the same state to all residues, especially those located at the beginning and end of repetitive structures. DSSP, DEFINE and PCURVE only assign 65% of residues to the same state [8]

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.