Abstract

Each living species carries a complex DNA sequence that determines their unique features and functionalities. It is generally assumed that life started from a random pool of oligonucleotides sequences, generated by a prebiotic polymerization of nucleotides. The mechanism that initially facilitated the emergence of sequences that code for the function of the first species from such a random pool of sequences remains unknown. It is a central problem of the origin of life. An interesting option would be a self-selection mechanism by spontaneous symmetry breaking. Initial concentration fluctuations of specific sequence motifs would have been amplified and outcompeted less abundant sequences, enhancing the signal to noise to replicate and select functional sequences. Here, we demonstrate with experimental and theoretical findings that templated ligation would provide such a self-selection. In templated ligation, two adjacent single sequences strands are chemically joined when a third complementary strand sequence brought them in close proximity. This simple mechanism was a likely side-product of a prebiotic polymerization chemistry once the strands reach the length to form double stranded species. As shown here, the ligation gave rise to a nonlinear replication process by the cooperative ligation of matching sequences which self-promoted their own elongation. This led to a cascade of enhanced template binding and faster ligation reactions. A requirement was the reshuffling of the strands by thermal cycling, enabled for example by microscale convection. Assuming that templated ligation was driven by the same chemical mechanism that generated prebiotic polymerization of oligonucleotides, the mechanism could function as a missing link between polymerization and the self-stabilized replication, offering a pathway to the autonomous emergence of Darwinian evolution for the origin of life.

Highlights

  • The genetic information of present-day living species is encoded in the DNA sequence as a specific combination of four different nucleotides: A, C, G, and T

  • By using a limited initial sequence space and performing long-term ligations, we find that complementary sequences with an initially higher concentration prevail over either noncomplementary or less-concentrated sequences

  • We study how, under the replication dynamics of templated ligation, sets of similar sequences with high concentrations could survive by replication while less-concentrated or uncorrelated sequences die out

Read more

Summary

Introduction

The genetic information of present-day living species is encoded in the DNA sequence as a specific combination of four different nucleotides: A, C, G, and T. Any sequence space of even a moderate length of, for example, 25 bases is so large (425 ≈ 1015 ) that, even with a significant volume and concentration, the sampling can only be sparse, meaning that each molecule would have a different sequence. It must be expected that, even if such a very short sequence would have encoded and conferred an advantageous function for molecular evolution, it would not have had an impact on such a random pool of sequences. This limited sampling of sequence space is even the case for the most sophisticated systematic

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call