Model Of Protein Evolution Research Articles

BackgroundThe reaction of HIV protease to inhibitor therapy is characterized by the emergence of complex mutational patterns which confer drug resistance. The response of HIV protease to drugs often involves both primary mutations that directly inhibit the action of the drug, and a host of accessory resistance mutations that may occur far from the active site but may contribute to restoring the fitness or stability of the enzyme. Here we develop a probabilistic approach based on connected information that allows us to study residue, pair level and higher-order correlations within the same framework.ResultsWe apply our methodology to a database of approximately 13,000 sequences which have been annotated by the treatment history of the patients from which the samples were obtained. We show that including pair interactions is essential for agreement with the mutational data, since neglect of these interactions results in order-of-magnitude errors in the probabilities of the simultaneous occurence of many mutations. The magnitude of these pair correlations changes dramatically between sequences obtained from patients that were or were not exposed to drugs. Higher-order effects make a contribution of as much as 10% for residues taken three at a time, but increase to more than twice that for 10 to 15-residue groups. The sequence data is insufficient to determine the higher-order effects for larger groups. We find that higher-order interactions have a significant effect on the predicted frequencies of sequences with large numbers of mutations. While relatively rare, such sequences are more prevalent after multi-drug therapy. The relative importance of these higher-order interactions increases with the number of drugs the patient had been exposed to.ConclusionCorrelations are critical for the understanding of mutation patterns in HIV protease. Pair interactions have substantial qualitative effects, while higher-order interactions are individually smaller but may have a collective effect. Together they lead to correlations which could have an important impact on the dynamics of the evolution of cross-resistance, by allowing the virus to pass through otherwise unlikely mutational states. These findings also indicate that pairwise and possibly higher-order effects should be included in the models of protein evolution, instead of assuming that all residues mutate independently of one another.

The complexity of protein structures calls for simplified representations of their topology. The simplest possible mathematical description of a protein structure is a one-dimensional profile representing, for instance, buriedness or secondary structure. This kind of representation has been introduced for studying the sequence to structure relationship, with applications to fold recognition. Here we define the effective connectivity profile (EC), a network theoretical profile that self-consistently represents the network structure of the protein contact matrix. The EC profile makes mathematically explicit the relationship between protein structure and protein sequence, because it allows predicting the average hydrophobicity profile (HP) and the distributions of amino acids at each site for families of homologous proteins sharing the same structure. In this sense, the EC provides an analytic solution to the statistical inverse folding problem, which consists in finding the statistical properties of the set of sequences compatible with a given structure. We tested these predictions with simulations of the structurally constrained neutral (SCN) model of protein evolution with structure conservation, for single- and multi-domain proteins, and for a wide range of mutation processes, the latter producing sequences with very different hydrophobicity profiles, finding that the EC-based predictions are accurate even when only one sequence of the family is known. The EC profile is very significantly correlated with the HP for sequence-structure pairs in the PDB as well. The EC profile generalizes the properties of previously introduced structural profiles to modular proteins such as multidomain chains, and its correlation with the sequence profile is substantially improved with respect to the previously defined profiles, particularly for long proteins. Furthermore, the EC profile has a dynamic interpretation, since the EC components are strongly inversely related with the temperature factors measured in X-ray experiments, meaning that positions with large EC component are more strongly constrained in their equilibrium dynamics. Last, the EC profile allows to define a natural measure of modularity that correlates with the number of domains composing the protein, suggesting its application for domain decomposition. Finally, we show that structurally similar proteins have similar EC profiles, so that the similarity between aligned EC profiles can be used as a structure similarity measure, a property that we have recently applied for protein structure alignment. The code for computing the EC profile is available upon request writing to ubastolla@cbm.uam.es, and the structural profiles discussed in this article can be downloaded from the SLOTH webserver http://www.fkp.tu-darmstadt.de/SLOTH/.

Model Of Protein Evolution Research Articles

Articles published on Model Of Protein Evolution

What's in a Likelihood? Simple Models of Protein Evolution and the Contribution of Structurally Viable Reconstructions to the Likelihood

Estudio computacional de las relaciones evolutivas de los receptores ionotrópicos NMDA, AMPA y kainato en cuatro especies de primates

Comparing Models of Evolution for Ordered and Disordered Proteins

Pairwise and higher-order correlations among drug-resistance mutations in HIV-1 subtype B protease

PROCOV: maximum likelihood estimation of protein phylogeny under covarion models and site-specific covarion pattern analysis

Neutral evolution of proteins: The superfunnel in sequence space and its relation to mutational robustness

The Universal Trend of Amino Acid Gain–Loss is Caused by CpG Hypermutability

Frequent and Widespread Parallel Evolution of Protein Sequences

Effective connectivity profile: A structural representation that evidences the relationship between protein structures and sequences

A Test of the Markov Model of Evolution in Proteins

Protein Interaction Network. Double Exponential Model

The look-ahead effect of phenotypic mutations

Evolutionary framework for protein sequence evolution and gene pleiotropy.

Neutral evolution of protein-protein interactions: a computational study using simple models

Neighboring-site effects of amino acid mutation

Quaternary Structure Constraints on Evolutionary Sequence Divergence

Assessing Site-Interdependent Phylogenetic Models of Sequence Evolution

Testing for Spatial Clustering of Amino Acid Replacements Within Protein Tertiary Structure

Molecular Evolution of the Plant Virus Family Bromoviridae Based on RNA3-Encoded Proteins

Glassy dynamics in the adaptive immune response prevents autoimmune disease.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Model Of Protein Evolution Research Articles

Articles published on Model Of Protein Evolution

What's in a Likelihood? Simple Models of Protein Evolution and the Contribution of Structurally Viable Reconstructions to the Likelihood

Estudio computacional de las relaciones evolutivas de los receptores ionotrópicos NMDA, AMPA y kainato en cuatro especies de primates

Comparing Models of Evolution for Ordered and Disordered Proteins

Pairwise and higher-order correlations among drug-resistance mutations in HIV-1 subtype B protease

PROCOV: maximum likelihood estimation of protein phylogeny under covarion models and site-specific covarion pattern analysis

Neutral evolution of proteins: The superfunnel in sequence space and its relation to mutational robustness

The Universal Trend of Amino Acid Gain–Loss is Caused by CpG Hypermutability

Frequent and Widespread Parallel Evolution of Protein Sequences

Effective connectivity profile: A structural representation that evidences the relationship between protein structures and sequences

A Test of the Markov Model of Evolution in Proteins

Protein Interaction Network. Double Exponential Model

The look-ahead effect of phenotypic mutations

Evolutionary framework for protein sequence evolution and gene pleiotropy.

Neutral evolution of protein-protein interactions: a computational study using simple models

Neighboring-site effects of amino acid mutation

Quaternary Structure Constraints on Evolutionary Sequence Divergence

Assessing Site-Interdependent Phylogenetic Models of Sequence Evolution

Testing for Spatial Clustering of Amino Acid Replacements Within Protein Tertiary Structure

Molecular Evolution of the Plant Virus Family Bromoviridae Based on RNA3-Encoded Proteins

Glassy dynamics in the adaptive immune response prevents autoimmune disease.