Clustering of Protein Structural Fragments Reveals Modular Building Block Approach of Nature

Ashish V Tendulkar,Anand A Joshi,Milind A Sohoni,Pramod P Wangikar

doi:10.1016/j.jmb.2004.02.047

Ashish V Tendulkar, Anand A Joshi + Show 2 more

Open Access

https://doi.org/10.1016/j.jmb.2004.02.047

Copy DOI

Abstract

Structures of peptide fragments drawn from a protein can potentially occupy a vast conformational continuum. We co-ordinatize this conformational space with the help of geometric invariants and demonstrate that the peptide conformations of the currently available protein structures are heavily biased in favor of a finite number of conformational types or structural building blocks. This is achieved by representing a peptides' backbone structure with geometric invariants and then clustering peptides based on closeness of the geometric invariants. This results in 12,903 clusters, of which 2207 are made up of peptides drawn from functionally and/or structurally related proteins. These are termed “functional” clusters and provide clues about potential functional sites. The rest of the clusters, including the largest few, are made up of peptides drawn from unrelated proteins and are termed “structural” clusters. The largest clusters are of regular secondary structures such as helices and beta strands as well as of beta hairpins. Several categories of helices and strands are discovered based on geometric differences. In addition to the known classes of loops, we discover several new classes, which will be useful in protein structure modeling. Our algorithm does not require assignment of secondary structure and, therefore, overcomes the limitations in loop classification due to ambiguity in secondary structure assignment at loop boundaries.

Full Text