Abstract

We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems.

Highlights

  • The Gram-negative bacterium Escherichia coli has been foundational to our understanding of bacterial genetics since 1946, when Lederberg and Tatum first demonstrated bacterial conjugation [1]

  • ER2796 contains a total of 249 single-base changes relative to MG1655 (S1 Table), including the 21 that we propose result from 4 gene conversion events

  • Examination of the complete sequence of ER2796 revealed the occurrence of four gene conversion events between repeated sequences

Read more

Summary

Introduction

The Gram-negative bacterium Escherichia coli has been foundational to our understanding of bacterial genetics since 1946, when Lederberg and Tatum first demonstrated bacterial conjugation [1]. Between that time and the start of the genome sequence era one half century later, a wealth of E. coli genotypic and phenotypic information was generated through laboratory manipulation of strains. There are finished genome sequences available for more than 75 E. coli strains, but the vast majority of these are wild type isolates, both pathogenic and PLOS ONE | DOI:10.1371/journal.pone.0127446. Complete Genome Sequence of E. coli ER2796 these authors are articulated in the ‘author contributions’ section There are finished genome sequences available for more than 75 E. coli strains, but the vast majority of these are wild type isolates, both pathogenic and PLOS ONE | DOI:10.1371/journal.pone.0127446 May 26, 2015

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call