Homology Inference Research Articles

BackgroundInference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database.ResultsThe general strategy employed by this algorithm is to combine the results of several existing sequence and structure comparison tools applied to a query protein of known structure in order to find the homologs already classified in SCOP database and thus determine classification assignments. The algorithm is able to map domains within newly solved structures to the appropriate SCOP superfamily level with ~95% accuracy. Examples of correctly mapped remote homologs are discussed. The algorithm is also capable of identifying potential evolutionary relationships not specified in the SCOP database, thus helping to make it better. The strategy of the mapping algorithm is not limited to SCOP and can be applied to any other evolutionary-based classification scheme as well. SCOPmap is available for download.ConclusionThe SCOPmap program is useful for assigning domains in newly solved structures to appropriate superfamilies and for identifying evolutionary links between different superfamilies.

Recognition of structural similarity in proteins invites the inference of homology even when the amino acid sequences are not highly similar. The influence of structural similarity on both the genetic tests for amino acid sequence similarity and the inference of homology was examined by statistical methods. Structure-dependent compositions of amino acid sequences representing segments of secondary and supersecondary structure were examined for the preferential occurrence of structurally similar amino acids that are also similar according to the genetic criteria of Fitch (1970), Dayhoff (1979) and McLachlan (1971). These analyses revealed that: (1) the preferential occurrence of structurally similar amino acids in analogous secondary structures should not give rise to a statistically significant sequence similarity; (2) some positional amino acid preferences in secondary and supersecondary structures score highly in sequence similarity tests; (3) the preferential occurrence of non-polar, β-branched amino acids in the parallel β-pleated sheets of α β proteins constitutes an important structural bias in genetic tests for sequence similarity. These findings may be applied to sequence comparisons whenever the conformational states are either known experimentally or may be inferred from predictive analysis of the amino acid sequence. The corrections for structure-dependent compositions were made in a search for homology between the two structurally similar, globular domains of bovine liver rhodanese. Despite these corrections and earlier failures to observe significant sequence similarity, a statistically significant sequence similarity was detected, supporting the inference that the domains are internally paralogous, i.e. intraspecies products of a partial internal duplication of an ancestral gene.

Homology Inference Research Articles

Related Topics

Articles published on Homology Inference

COMPASS server for remote homology inference

A streptococcal protease that degrades CXC chemokines and impairs bacterial clearance from infected tissues

Hox genes, homology and axis formation—The application of morphological concepts to evolutionary developmental biology

Terminal addition, the Cambrian radiation and the Phanerozoic evolution of bilaterian form

Statistical distributions of optimal global alignment scores of random protein sequences

FSSA: a novel method for identifying functional signatures from structural alignments

SCOPmap: Automated assignment of protein structures to evolutionary superfamilies

Preface

The Science of Phylogenetic Systematics: Explanation, Prediction, and Test

The Science of Phylogenetic Systematics: Explanation, Prediction, and Test.

FORMATION AND HOMOLOGY OF RADULAR TEETH; A CASE STUDY USING COLUMBELLID GASTROPODS (NEOGASTROPODA: COLUMBELLIDAE)

Epistemología de la investigación taxonómica: inferencias filogenéticas y su evaluación

Biochemistry and genetics of monoamine oxidase

An examination of the expected degree of sequence similarity that might arise in proteins that have converged to similar conformational states: The impact of such expectations on the search for homology between the structurally similar domains of rhodanese

A biological homology inference from ergodic theory

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Homology Inference Research Articles

Related Topics

Articles published on Homology Inference

COMPASS server for remote homology inference

A streptococcal protease that degrades CXC chemokines and impairs bacterial clearance from infected tissues

Hox genes, homology and axis formation—The application of morphological concepts to evolutionary developmental biology

Terminal addition, the Cambrian radiation and the Phanerozoic evolution of bilaterian form

Statistical distributions of optimal global alignment scores of random protein sequences

FSSA: a novel method for identifying functional signatures from structural alignments

SCOPmap: Automated assignment of protein structures to evolutionary superfamilies

Preface

The Science of Phylogenetic Systematics: Explanation, Prediction, and Test

The Science of Phylogenetic Systematics: Explanation, Prediction, and Test.

FORMATION AND HOMOLOGY OF RADULAR TEETH; A CASE STUDY USING COLUMBELLID GASTROPODS (NEOGASTROPODA: COLUMBELLIDAE)

Epistemología de la investigación taxonómica: inferencias filogenéticas y su evaluación

Biochemistry and genetics of monoamine oxidase

An examination of the expected degree of sequence similarity that might arise in proteins that have converged to similar conformational states: The impact of such expectations on the search for homology between the structurally similar domains of rhodanese

A biological homology inference from ergodic theory