Structural Alphabets for Protein Structure Classification: A Comparison Study

Quan Le,Gianluca Pollastri,Patrice Koehl

doi:10.1016/j.jmb.2008.12.044

Abstract

Finding structural similarities between proteins often helps reveal shared functionality, which otherwise might not be detected by native sequence information alone. Such similarity is usually detected and quantified by protein structure alignment. Determining the optimal alignment between two protein structures, however, remains a hard problem. An alternative approach is to approximate each three-dimensional protein structure using a sequence of motifs derived from a structural alphabet. Using this approach, structure comparison is performed by comparing the corresponding motif sequences or structural sequences. In this article, we measure the performance of such alphabets in the context of the protein structure classification problem. We consider both local and global structural sequences. Each letter of a local structural sequence corresponds to the best matching fragment to the corresponding local segment of the protein structure. The global structural sequence is designed to generate the best possible complete chain that matches the full protein structure. We use an alphabet of 20 letters, corresponding to a library of 20 motifs or protein fragments having four residues. We show that the global structural sequences approximate well the native structures of proteins, with an average coordinate root mean square of 0.69 Å over 2225 test proteins. The approximation is best for all α-proteins, while relatively poorer for all β-proteins. We then test the performance of four different sequence representations of proteins (their native sequence, the sequence of their secondary-structure elements, and the local and global structural sequences based on our fragment library) with different classifiers in their ability to classify proteins that belong to five distinct folds of CATH. Without surprise, the primary sequence alone performs poorly as a structure classifier. We show that addition of either secondary-structure information or local information from the structural sequence considerably improves the classification accuracy. The two fragment-based sequences perform better than the secondary-structure sequence but not well enough at this stage to be a viable alternative to more computationally intensive methods based on protein structure alignment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structural Alphabets for Protein Structure Classification: A Comparison Study

Abstract

Talk to us

Similar Papers

More From: Journal of Molecular Biology

Lead the way for us

Journal: Journal of Molecular Biology	Publication Date: Dec 25, 2008
Citations: 36

Similar Papers

Protein structure comparison using bipartite graph matching and its application to protein structure classification.
William R Taylor
Molecular & cellular proteomics : MCP | VOL. 1
William R TaylorWilliam R Taylor
04 Mar 2002
Molecular & cellular proteomics : MCP | VOL. 1

Inverse Kinematics in Biology: The Protein Loop Closure Problem
Rachel Kolodny ... Patrice Koehl
The International Journal of Robotics Research | VOL. 24
Rachel Kolodny, et. al.Rachel Kolodny ... Patrice Koehl
01 Feb 2005
The International Journal of Robotics Research | VOL. 24

Improvement of protein structure comparison using a structural alphabet
Agnel Praveen Joseph ... Alexandre G De Brevern
Biochimie | VOL. 93
Agnel Praveen Joseph, et. al.Agnel Praveen Joseph ... Alexandre G De Brevern
05 May 2011
Biochimie | VOL. 93

DoSA: Database of Structural Alignments
S Mahajan ... A G De Brevern
Database | VOL. 2013
S Mahajan, et. al.S Mahajan ... A G De Brevern
11 Jul 2013
Database | VOL. 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structural Alphabets for Protein Structure Classification: A Comparison Study

Abstract

Talk to us

Similar Papers

More From: Journal of Molecular Biology