Protein family comparison using statistical models and predicted structural information

Richard Chung,Golan Yona

doi:10.1186/1471-2105-5-183

Abstract

BackgroundThis paper presents a simple method to increase the sensitivity of protein family comparisons by incorporating secondary structure (SS) information. We build upon the effective information theory approach towards profile-profile comparison described in [Yona & Levitt 2002]. Our method augments profile columns using PSIPRED secondary structure predictions and assesses statistical similarity using information theoretical principles.ResultsOur tests show that this tool detects more similarities between protein families of distant homology than the previous primary sequence-based method. A very significant improvement in performance is observed when the real secondary structure is used.ConclusionsIntegration of primary and secondary structure information can substantially improve detection of relationships between remotely related protein families.

Highlights

This paper presents a simple method to increase the sensitivity of protein family comparisons by incorporating secondary structure (SS) information
Our method extends our previous work on profile-profile comparison [19]
Data sets We use a data set of domain families derived from the SCOP classification of protein structures [20], release 1.50

Summary

Methodology article

Protein family comparison using statistical models and predicted structural information. Address: Department of Computer Science, Cornell University, Ithaca, NY 14850, USA. Published: 25 November 2004 BMC Bioinformatics 2004, 5:183 doi:10.1186/1471-2105-5-183

Background

Methods and Results

Discussion

74 KKEGAKLQEVVLYQF 88

84 LQSEHAKVHSFHDYELQYSALNHTTTLFVDGQQITTWAGEVSQ 126

Conclusion

Murzin AG

Pearson WR

13. Jones DT

27. Kullback S

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jan 1, 2004
Citations: 33	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Protein family comparison using statistical models and predicted structural information

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology
Jb Brown ... Tatsuya Akutsu
BMC Bioinformatics | VOL. 10
Jb Brown, et. al.Jb Brown ... Tatsuya Akutsu
20 Jan 2009
BMC Bioinformatics | VOL. 10

Protein Multiple Alignment Incorporating Primary and Secondary Structure Information
Nak-Kyeong Kim ... Jun Xie
Journal of Computational Biology | VOL. 13
Nak-Kyeong Kim, et. al.Nak-Kyeong Kim ... Jun Xie
01 Nov 2006
Journal of Computational Biology | VOL. 13

Protein Multiple Alignment Incorporating Primary and Secondary Structure Information
Nak-Kyeong Kim ... Jun Xie
Journal of Computational Biology | VOL. 13
Nak-Kyeong Kim, et. al.Nak-Kyeong Kim ... Jun Xie
01 Dec 2006
Journal of Computational Biology | VOL. 13

The analysis of rRNA sequence-structure in phylogenetics: An application to the family Pectinidae (Mollusca: Bivalvia)
Daniele Salvi ... Paolo Mariottini
Molecular Phylogenetics and Evolution | VOL. 56
Daniele Salvi, et. al.Daniele Salvi ... Paolo Mariottini
21 Apr 2010
Molecular Phylogenetics and Evolution | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Protein family comparison using statistical models and predicted structural information

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics