A four-column theory for the origin of the genetic code: tracing the evolutionary pathways that gave rise to an optimized code

Paul G Higgs

doi:10.1186/1745-6150-4-16

Abstract

BackgroundThe arrangement of the amino acids in the genetic code is such that neighbouring codons are assigned to amino acids with similar physical properties. Hence, the effects of translational error are minimized with respect to randomly reshuffled codes. Further inspection reveals that it is amino acids in the same column of the code (i.e. same second base) that are similar, whereas those in the same row show no particular similarity. We propose a 'four-column' theory for the origin of the code that explains how the action of selection during the build-up of the code leads to a final code that has the observed properties.ResultsThe theory makes the following propositions. (i) The earliest amino acids in the code were those that are easiest to synthesize non-biologically, namely Gly, Ala, Asp, Glu and Val. (ii) These amino acids are assigned to codons with G at first position. Therefore the first code may have used only these codons. (iii) The code rapidly developed into a four-column code where all codons in the same column coded for the same amino acid: NUN = Val, NCN = Ala, NAN = Asp and/or Glu, and NGN = Gly. (iv) Later amino acids were added sequentially to the code by a process of subdivision of codon blocks in which a subset of the codons assigned to an early amino acid were reassigned to a later amino acid. (v) Later amino acids were added into positions formerly occupied by amino acids with similar properties because this can occur with minimal disruption to the proteins already encoded by the earlier code. As a result, the properties of the amino acids in the final code retain a four-column pattern that is a relic of the earliest stages of code evolution.ConclusionThe driving force during this process is not the minimization of translational error, but positive selection for the increased diversity and functionality of the proteins that can be made with a larger amino acid alphabet. Nevertheless, the code that results is one in which translational error is minimized. We define a cost function with which we can compare the fitness of codes with varying numbers of amino acids, and a barrier function, which measures the change in cost immediately after addition of a new amino acid. We show that the barrier is positive if an amino acid is added into a column with dissimilar properties, but negative if an amino acid is added into a column with similar physical properties. Thus, natural selection favours the assignment of amino acids to the positions that they occupy in the final code.ReviewersThis article was reviewed by David Ardell, Eugene Koonin and Stephen Freeland (nominated by Laurence Hurst)

Highlights

The arrangement of the amino acids in the genetic code is such that neighbouring codons are assigned to amino acids with similar physical properties
The theory given here uses a formula for code cost that arises straightforwardly from previous cost formulae used for comparing random codes, but develops this into an evolutionary argument by which the pathways of code evolution can be predicted
We have improved on previous work by introducing a more realistic amino acid distance measure dW that is derived from maximum likelihood fitting of real protein sequence data

Summary

Introduction

The arrangement of the amino acids in the genetic code is such that neighbouring codons are assigned to amino acids with similar physical properties. It is well known that the arrangement of amino acids in the standard genetic code is distinctly non-random and is such that neighbouring codons (i.e. those that differ at only one of the three positions) are assigned to amino acids with similar physical properties. Given that naturally occurring protein sequences have already been selected for efficient function for long periods, most random amino acid changes introduced into a protein sequence are likely to be deleterious. A mutation at a single DNA site will cause an amino acid to be replaced by the amino acid assigned to the neighbouring codon; the arrangement of genetic code is such that mutations are likely to be less deleterious than in a random code. It appears that the standard code is optimized to reduce the effects of both translational error and deleterious mutations

Objectives

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biology Direct	Publication Date: Jan 1, 2009
Citations: 175	License type: cc-by

R Discovery Prime

R Discovery Prime

A four-column theory for the origin of the genetic code: tracing the evolutionary pathways that gave rise to an optimized code

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biology Direct

Lead the way for us

Similar Papers

Theories of the origin of the genetic code: Strong corroboration for the coevolution theory
Massimo Di Giulio
BioSystems | VOL. 239
Massimo Di GiulioMassimo Di Giulio
24 Apr 2024
BioSystems | VOL. 239

Early Changes in Plasma Amino Acid Concentrations during Aggressive Nutritional Therapy in Extremely Low Birth Weight Infants
Cynthia Liudmilla Blanco ... Edward A Liechty
The Journal of Pediatrics | VOL. 158
Cynthia Liudmilla Blanco, et. al.Cynthia Liudmilla Blanco ... Edward A Liechty
03 Dec 2010
The Journal of Pediatrics | VOL. 158

An extension of the coevolution theory of the origin of the genetic code.
Massimo Di Giulio
Biology Direct | VOL. 3
Massimo Di GiulioMassimo Di Giulio
01 Jan 2008
Biology Direct | VOL. 3

The optimality of the standard genetic code assessed by an eight-objective evolutionary algorithm
Małgorzata Wnętrzak ... Paweł Błażej
BMC Evolutionary Biology | VOL. 18
Małgorzata Wnętrzak, et. al.Małgorzata Wnętrzak ... Paweł Błażej
01 Dec 2018
BMC Evolutionary Biology | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A four-column theory for the origin of the genetic code: tracing the evolutionary pathways that gave rise to an optimized code

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biology Direct