Highly Sensitive Detection of Individual HEAT and ARM Repeats with HHpred and COACH

Fred Kippert,Dietlind L Gerloff

doi:10.1371/journal.pone.0007148

Abstract

BackgroundHEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat.Methodology and Principal FindingsTesting algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries.SignificanceA protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high sensitivity at a low false positive rate and will therefore greatly enhance the accuracy of predictions of HEAT and ARM domains.

Highlights

Internal tandem duplications have played an important role in protein evolution
We have developed a new protocol for the detection of HEAT and ARM repeats
Since Andrade et al.’s systematic study [4], a considerable number of additional structures have been deposited in the PDB database for which an association with HEAT/ARM has been made by the authors

Summary

Introduction

Internal tandem duplications have played an important role in protein evolution. Multiple duplications of segments 30 to 50 residues in length have been successful as judged from their spread in the eukaryotic cell [1,2]. In this paper we will i) calibrate HHpred and COACH and define confidence score ranges, ii) evaluate the sensitivity of our method with reference to all available HEAT/ARM structures, iii) investigate the occurrence of false positives, iv) apply the method to candidate proteins, and v) discuss potential limitations of the method when dealing with highly divergent family members.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Sep 24, 2009
Citations: 60	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Highly Sensitive Detection of Individual HEAT and ARM Repeats with HHpred and COACH

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Author response: Structural basis for SARM1 inhibition and activation under energetic stress
Michael Sporny ... Carsten Mim
-
Michael Sporny, et. al.Michael Sporny ... Carsten Mim
21 Oct 2020
21 Oct 2020

Structural and Functional Relationships between the Lectin and Arm Domains of Calreticulin
Cosmin L Pocanschi ... Kalle Gehring
Journal of Biological Chemistry | VOL. 286
Cosmin L Pocanschi, et. al.Cosmin L Pocanschi ... Kalle Gehring
01 Aug 2011
Journal of Biological Chemistry | VOL. 286

The C-terminal domain of armadillo binds to hypophosphorylated teashirt to modulate wingless signalling in Drosophila.
A Gallet
The EMBO Journal | VOL. 18
A GalletA Gallet
15 Apr 1999
The EMBO Journal | VOL. 18

The adoption of a twisted structure of importin-β is essential for the protein-protein interaction required for nuclear transport
Soo Jae Lee ... Tomitake Tsukihara
Journal of Molecular Biology | VOL. 302
Soo Jae Lee, et. al.Soo Jae Lee ... Tomitake Tsukihara
01 Sep 2000
Journal of Molecular Biology | VOL. 302

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Highly Sensitive Detection of Individual HEAT and ARM Repeats with HHpred and COACH

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE