Abstract

BackgroundProteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem.ResultsIt was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation.ConclusionsThe results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.Electronic supplementary materialThe online version of this article (doi:10.1186/1756-0500-7-654) contains supplementary material, which is available to authorized users.

Highlights

  • Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study

  • In molecular bioinformatics, elucidating how a protein folds into its native structure is a significant unsolved problem that is related to the modelling and design of new protein 3D structures

  • We examine whether the inter-residue average distance statistics can be used to extract new information on 3D structures from these sequences in addition to the information gained from the standard sequence analysis techniques

Read more

Summary

Introduction

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In molecular bioinformatics, elucidating how a protein folds into its native structure is a significant unsolved problem that is related to the modelling and design of new protein 3D structures To address this problem, we have to understand the relationship between the amino acid sequence and the 3D structure of a protein. Discriminating between two alternative structures with a very high sequence identity on the basis of energetics simulations is considered difficult [5]

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.