Number Of Secondary Structure Elements Research Articles

We present fast and simple-to-implement measures of the entanglement of protein tertiary structures which are appropriate for highly flexible structure comparison. These are performed using the SKMT algorithm, a novel method of smoothing the Cα backbone to achieve a minimal complexity curve representation of the manner in which the protein's secondary structure elements fold to form its tertiary structure. Its subsequent complexity is characterised using measures based on the writhe and crossing number quantities heavily utilised in DNA topology studies, and which have shown promising results when applied to proteins recently. The SKMT smoothing is used to derive empirical bounds on a protein's entanglement relative to its number of secondary structure elements. We show that large scale helical geometries dominantly account for the maximum growth in entanglement of protein monomers, and further that this large scale helical geometry is present in a large array of proteins, consistent across a number of different protein structure types and sequences. We also show how these bounds can be used to constrain the search space of protein structure prediction from small angle x-ray scattering experiments, a method highly suited to determining the likely structure of proteins in solution where crystal structure or machine learning based predictions often fail to match experimental data. Finally we develop a structural comparison metric based on the SKMT smoothing which is used in one specific case to demonstrate significant structural similarity between Rossmann fold and TIM Barrel proteins, a link which is potentially significant as attempts to engineer the latter have in the past produced the former. We provide the SWRITHE interactive python notebook to calculate these metrics.

Read full abstract

Shape similarity is one of the most elusive and intriguing questions of nature and mathematics. Proteins provide a rich domain in which to test theories of shape similarity. Proteins can match at different scales and in different arrangements. Sometimes the detection of common local structure is sufficient to infer global alignment of two proteins; at other times it provides false information. Proteins with very low sequence identity may share large substructures, or perhaps just a central core. There are even examples of proteins with nearly identical primary sequences in which alpha-helices have become beta-sheets. Shape similarity can be formulated (i) in terms of global metrics, such as RMSD or Hausdorff distance, (ii) in terms of subgraph isomorphisms, such as the detection of shared substructures with similar relative locations, or (iii) purely topologically, in terms of structure preserving transformations. Existing protein structure detection programs are built on the first two types of similarity. The third forms the foundations of knot theory. The thesis of this paper is this: Protein similarity detection leads naturally to algorithms operating at the metric, relational, and isotopic scales. The paper introduces a definition of similarity based on atomic motions that preserve local backbone topology without incurring significant distance errors. Such motions are motivated by the physical requirements for rearranging subsequences of a protein. Similarity detection then seeks rigid body motions able to overlay pairs of substructures, each related by a substructure-preserving motion, without necessarily requiring global structure preservation. This definition is general enough to span a wide range of questions: One can ask for full rearrangement of one protein into another while preserving global topology, as in drug design; or one can ask for rearrangements of sets of smaller substructures, preserving local but not global topology, as in protein evolution. In the appendix, we exhibit an algorithm for answering the general rearrangement question. That algorithm has the complexity of robot motion planning. In the text, we consider a more common case in which one seeks protein similarity by rearrangements of relatively short peptide segments. We exhibit two algorithms, one based on writhing numbers and one based on line weavings. The algorithms have time complexities O(n (4)) and O(s (11)), respectively, where n is the maximum number of residues in the proteins being compared and s is the number of secondary structure elements. In practice, the running times were nearly interactive. We report results obtained with a dozen pairs of proteins, exhibiting a range of typical features.

Read full abstract

Number Of Secondary Structure Elements Research Articles

Related Topics

Articles published on Number Of Secondary Structure Elements

The SKMT Algorithm: A method for assessing and comparing underlying protein entanglement.

Terahertz Spectral Domain Computational Analysis of Hydration Shell of Proteins with Increasingly Complex Tertiary Structure

Automatic structure classification of small proteins using random forest

Exploring protein structural dissimilarity to facilitate structure classification.

Symmetric Connectivity of Secondary Structure Elements Enhances the Diversity of Folding Pathways

Protein Similarity from Knot Theory: Geometric Convolution and Line Weavings

Analysis of fragments induced by simulated lattice protein folding

A coarse-grained, “realistic” model for protein folding

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Number Of Secondary Structure Elements Research Articles

Related Topics

Articles published on Number Of Secondary Structure Elements

The SKMT Algorithm: A method for assessing and comparing underlying protein entanglement.

Terahertz Spectral Domain Computational Analysis of Hydration Shell of Proteins with Increasingly Complex Tertiary Structure

Automatic structure classification of small proteins using random forest

Exploring protein structural dissimilarity to facilitate structure classification.

Symmetric Connectivity of Secondary Structure Elements Enhances the Diversity of Folding Pathways

Protein Similarity from Knot Theory: Geometric Convolution and Line Weavings

Analysis of fragments induced by simulated lattice protein folding

A coarse-grained, “realistic” model for protein folding