Abstract

BackgroundProkaryotes have relatively small genomes, densely-packed with protein-encoding sequences. RNA sequencing has, however, revealed surprisingly complex transcriptomes and here we report the transcripts present in the model hyperthermophilic Archaeon, Thermococcus kodakarensis, under different physiological conditions.ResultsSequencing cDNA libraries, generated from RNA isolated from cells under different growth and metabolic conditions has identified >2,700 sites of transcription initiation, established a genome-wide map of transcripts, and consensus sequences for transcription initiation and post-transcription regulatory elements. The primary transcription start sites (TSS) upstream of 1,254 annotated genes, plus 644 primary TSS and their promoters within genes, are identified. Most mRNAs have a 5'-untranslated region (5'-UTR) 10 to 50 nt long (median = 16 nt), but ~20% have 5'-UTRs from 50 to 300 nt long and ~14% are leaderless. Approximately 50% of mRNAs contain a consensus ribosome binding sequence. The results identify TSS for 1,018 antisense transcripts, most with sequences complementary to either the 5'- or 3'-region of a sense mRNA, and confirm the presence of transcripts from all three CRISPR loci, the RNase P and 7S RNAs, all tRNAs and rRNAs and 69 predicted snoRNAs. Two putative riboswitch RNAs were present in growing but not in stationary phase cells. The procedure used is designed to identify TSS but, assuming that the number of cDNA reads correlates with transcript abundance, the results also provide a semi-quantitative documentation of the differences in T. kodakarensis genome expression under different growth conditions and confirm previous observations of substrate-dependent specific gene expression. Many previously unanticipated small RNAs have been identified, some with relative low GC contents (≤50%) and sequences that do not fold readily into base-paired secondary structures, contrary to the classical expectations for non-coding RNAs in a hyperthermophile.ConclusionThe results identify >2,700 TSS, including almost all of the primary sites of transcription initiation upstream of annotated genes, plus many secondary sites, sites within genes and sites resulting in antisense transcripts. The T. kodakarensis genome is small (~2.1 Mbp) and tightly packed with protein-encoding genes, but the transcriptomes established also contain many non-coding RNAs and predict extensive RNA-based regulation in this model Archaeon.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-684) contains supplementary material, which is available to authorized users.

Highlights

  • Prokaryotes have relatively small genomes, densely-packed with protein-encoding sequences

  • T. kodakarensis transcripts and transcription start sites (TSS) Identifying transcripts and precisely mapping transcription start sites (TSS) refines and extends genome annotation and the discovery of regulatory elements that control gene expression. To obtain this knowledge for T. kodakarensis, we employed differential high-throughput sequencing of cDNA libraries generated from RNA preparations isolated from cells grown with and without sulfur, growth conditions that result in different patterns of gene expression and metabolism [32,33,34,35]

  • CDNA libraries were generated and sequenced from RNA isolated from T. kodakarensis cells growing exponentially (Sexp) and to stationary phase (Sstat) in ASW-YT medium with sulfur, growing exponentially in ASW-YT with pyruvate (Pexp), and from cells growing exponentially in pyruvate but 20 min after sulfur addition (PS)

Read more

Summary

Introduction

Prokaryotes have relatively small genomes, densely-packed with protein-encoding sequences. By using high-throughput RNA sequencing (RNA-seq), it is possible to identify essentially all transcripts present in cells [4,5] and such studies have revealed surprisingly complex transcriptomes in Bacteria, with many previously unanticipated non-coding small (sRNA) and antisense RNAs [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20] To add to this database, and to add to the relatively few RNA-seq studies reported to date for Archaea [21,22,23,24,25,26,27,28,29,30,31], we have used differential RNA-seq technology (dRNA-seq) to identify the transcripts present in T. kodakarensis cells growing on different substrates and in stationary-phase cells. Based on the conservation of sequences upstream of the TSS identified, consensus sequences have been identified for the core elements of T. kodakarensis promoters from which the synthesis of primary, secondary, internal and antisense transcripts is initiated

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.