Abstract

Recent developments in the analysis of amino acid covariation are leading to breakthroughs in protein structure prediction, protein design, and prediction of the interactome. It is assumed that observed patterns of covariation are caused by molecular coevolution, where substitutions at one site affect the evolutionary forces acting at neighboring sites. Our theoretical and empirical results cast doubt on this assumption. We demonstrate that the strongest coevolutionary signal is a decrease in evolutionary rate and that unfeasibly long times are required to produce coordinated substitutions. We find that covarying substitutions are mostly found on different branches of the phylogenetic tree, indicating that they are independent events that may or may not be attributable to coevolution. These observations undermine the hypothesis that molecular coevolution is the primary cause of the covariation signal. In contrast, we find that the pairs of residues with the strongest covariation signal tend to have low evolutionary rates, and that it is this low rate that gives rise to the covariation signal. Slowly evolving residue pairs are disproportionately located in the protein’s core, which explains covariation methods’ ability to detect pairs of residues that are close in three dimensions. These observations lead us to propose the “coevolution paradox”: The strength of coevolution required to cause coordinated changes means the evolutionary rate is so low that such changes are highly unlikely to occur. As modern covariation methods may lead to breakthroughs in structural genomics, it is critical to recognize their biases and limitations.

Highlights

  • In recent years, several new methods have been developed to study amino acid covariation within protein sequences (Marks et al 2011; Jones et al 2012; Kamisetty et al 2013)

  • Many covariation methods assume that both episodic coevolution and directional selection lead to correlated substitutions between pairs of residues on the same branch of a phylogenetic tree, so the ratio of double to single changes on individual branches may provide a meaningful measure of the strength of coevolution

  • We show that a range of different coevolutionary and independent evolutionary scenarios are indistinguishable from one another based solely on the observation of covariation

Read more

Summary

Introduction

Several new methods have been developed to study amino acid covariation within protein sequences (Marks et al 2011; Jones et al 2012; Kamisetty et al 2013) These methodological developments have led to a resurgence of interest in covariation methods, and the promise of wide application to problems as diverse as de novo protein structure prediction, analysis of protein complexes, and protein design (Durani and Magliery 2013; Reynolds et al 2013; Taylor et al 2013). These coevolutionary pressures arise from functional or structural selective pressures acting to maintain specific subsets of residues at those positions, and as such, coevolution can be considered analogous to epistasis between sites (Fitch and Markowitz 1970; Tufts et al 2015)

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call