Abstract

Alignments of orthologous protein sequences convey a complex picture. Some positions are utterly conserved whilst others have diverged to variable degrees. Amongst the latter, many are non-exchangeable between extant sequences. How do functionally critical and highly conserved residues diverge? Why and how did these exchanges become incompatible within contemporary sequences? Our model is phosphoglycerate kinase (PGK), where lysine 219 is an essential active-site residue completely conserved throughout Eukaryota and Bacteria, and serine is found only in archaeal PGKs. Contemporary sequences tested exhibited complete loss of function upon exchanges at 219. However, a directed evolution experiment revealed that two mutations were sufficient for human PGK to become functional with serine at position 219. These two mutations made position 219 permissive not only for serine and lysine, but also to a range of other amino acids seen in archaeal PGKs. The identified trajectories that enabled exchanges at 219 show marked sign epistasis - a relatively small loss of function with respect to one amino acid (lysine) versus a large gain with another (serine, and other amino acids). Our findings support the view that, as theoretically described, the trajectories underlining the divergence of critical positions are dominated by sign epistatic interactions. Such trajectories are an outcome of rare mutational combinations. Nonetheless, as suggested by the laboratory enabled K219S exchange, given enough time and variability in selection levels, even utterly conserved and functionally essential residues may change.

Highlights

  • Universally-spread proteins suggest a common ancestor dating back to the last universal common ancestor (LUCA),3.5 billion years ago [1,2,3]

  • All bacteria and eukaryotes have lysine at position 219, as does the inferred ancestor of all phosphoglycerate kinase (PGK)

  • Given its clear phylogenetic pattern, and the well-defined catalytic role of lysine 219 in human PGK, we focused on the exchange at position 219

Read more

Summary

Introduction

Universally-spread proteins suggest a common ancestor dating back to the last universal common ancestor (LUCA), ,3.5 billion years ago [1,2,3] They indicate that most positions have drifted from their original state, but many, typically active-site positions, remained unchanged. Sign epistasis relates to a qualitative rather than a quantitative effect, namely a mutation that is beneficial within one genetic background but deleterious at another This form of epistasis results in a very slow yet constant divergence rate, suggesting that protein sequences have not yet reached the limits of their divergence potential [6]. Protein sequence divergence and epistasis have been studied [4,10,13,14] It remains unclear, how sequence space is traversed at crucial active-site positions, in positions that have a key functional role and are highly conserved. The mechanism(s) leading to incompatibility — i.e., an originally tolerated exchange becoming unacceptable in orthologous sequences, are unclear

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call