Ribosomal Protein Cluster Organization in Asgard Archaea.
It has been proposed that the superphylum of Asgard Archaea may represent a historical link between the Archaea and Eukarya. Following the discovery of the Archaea, it was soon appreciated that archaeal ribosomes were more similar to those of Eukarya rather than Bacteria. Coupled with other eukaryotic-like features, it has been suggested that the Asgard Archaea may be directly linked to eukaryotes. However, the genomes of Bacteria and non-Asgard Archaea generally organize ribosome-related genes into clusters that likely function as operons. In contrast, eukaryotes typically do not employ an operon strategy. To gain further insight into conservation of the r-protein genes, the genome order of conserved ribosomal protein (r-protein) coding genes was identified in 17 Asgard genomes (thirteen complete genomes and four genomes with less than 20 contigs) and compared with those found previously in non-Asgard archaeal and bacterial genomes. A universal core of two clusters of 14 and 4 cooccurring r-proteins, respectively, was identified in both the Asgard and non-Asgard Archaea. The equivalent genes in the E. coli version of the cluster are found in the S10 and spc operons. The large cluster of 14 r-protein genes (uS19-uL22-uS3-uL29-uS17 from the S10 operon and uL14-uL24-uL5-uS14-uS8-uL6-uL18-uS5-uL30-uL15 from the spc operon) occurs as a complete set in the genomes of thirteen Asgard genomes (five Lokiarchaeotes, three Heimdallarchaeotes, one Odinarchaeote, and four Thorarchaeotes). Four less conserved clusters with partial bacterial equivalents were found in the Asgard. These were the L30e (str operon in Bacteria) cluster, the L18e (alpha operon in Bacteria) cluster, the S24e-S27ae-rpoE1 cluster, and the L31e, L12..L1 cluster. Finally, a new cluster referred to as L7ae was identified. In many cases, r-protein gene clusters/operons are less conserved in their organization in the Asgard group than in other Archaea. If this is generally true for nonribosomal gene clusters, the results may have implications for the history of genome organization. In particular, there may have been an early transition to or from the operon approach to genome organization. Other nonribosomal cellular features may support different relationships. For this reason, it may be important to consider ribosome features separately.
- Research Article
21
- 10.1007/bf00355061
- Apr 1, 1991
- Current Genetics
rps10, encoding the plastid ribosomal protein S10, is a nuclear gene in higher plants and green algae, and is missing from the large ribosomal protein gene cluster of chlorophyll b-type plastids that contains components of the prokaryotic S10, spc and alpha operons. The cyanelle genome of Cyanophora paradoxa is shown to harbor rps10 as another specific feature of its organization. However, this novel plastid gene is not contiguous with the genes of the "S10" operon, but is adjacent to, and cotranscribed with, the str operon, a trait also found in archaebacteria.
- Book Chapter
2
- 10.1007/978-3-642-75145-5_13
- Jan 1, 1991
The cyanelles from the flagellated protist Cyanophora paradoxa constitute a plastid type sui generis. Cyanelles are surrounded by a murein layer and harbor a number of unusual genes in their 127 kbp genome (Wasmann et al., 1987). In this report a 12 kbp region will be described that contains a number of clustered genes encoding ribosomal proteins the complete sequence of which has been determined recently (Evrard et al., 1990a,b,c; Bryant and Stirewalt, 1990; Kraus et al., 1990; Michalowski et al., 1990; Bryant et al., 1990). Figure 1 depicts the central part of the large single copy region of cyanelle DNA from C. paradoxa 555 UTEX showing genes for 18 ribosomal proteins, one translation factor, ferredoxin I and two tRNAs. In addition two ORFs are present, the significance of which has yet to be determined. These genes appear to be organized in an Operon structure (str operon), a transcription unit comprising rpsl8 and rpl33 and a large gene cluster in which many genes from the prokaryotic S10 and spc operons are included. A comparison of the organization of these genes with those of cyanobacteria as the close relatives of cyanelles is possible to a limited degree only. The substantial progress in cyanobacterial molecular genetics during the past several years has been focussed mostly on the characterization of photosynthetic genes. Only for the str Operon data are available.
- Research Article
49
- 10.1021/jf2004095
- Apr 15, 2011
- Journal of Agricultural and Food Chemistry
A rapid bacterial identification method by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) using ribosomal proteins coded in S10 and spc operons as biomarkers, named the S10-GERMS (the S10-spc-alpha operon gene encoded ribosomal protein mass spectrum) method, was applied for the genus Bacillus a Gram-positive bacterium. The S10-GERMS method could successfully distinguish the difference between B. subtilis subsp. subtilis NBRC 13719(T) and B. subtilis subsp. spizizenii NBRC 101239(T) because of the mass difference of 2 ribosomal subunit proteins, despite the difference of only 2 bases in the 16S rRNA gene between them. The 8 selected reliable and reproducible ribosomal subunit proteins without disturbance of S/N level on MALDI-TOF MS analysis, S10, S14, S19, L18, L22, L24, L29, and L30, coded in S10 and spc operons were significantly useful biomarkers for rapid bacterial classification at species and strain levels by the S10-GERMS method of genus Bacillus strains without purification of ribosomal proteins.
- Research Article
51
- 10.1007/bf00271555
- Nov 1, 1990
- Molecular and General Genetics MGG
In Cyanophora paradoxa photosynthetic organelles termed cyanelles perform the functions of chloroplasts in higher plants, while the structural and biochemical characteristics of the cyanelle are essentially cyanobacterial. Our interest in studying the evolutionary relationship between cyanelles and chloroplasts led us to focus on cyanelle-encoded genes of the translational apparatus, specifically genes equivalent to those of the bacterial S10 and spc operons. The structure of a large ribosomal protein gene cluster from cyanelle DNA was characterized and compared with that from plastids and bacteria. Sequences of the following cyanelle genes encompassing 4.8 kb are reported here: 5'-rpl22-rps3-rpl16-rps17-rpl14-rpl5-rps8-rpl6-rpl18- rps5-3'. Cyanelles contain five more ribosomal protein genes than do higher plant chloroplasts and four more genes than Euglena gracilis plastids in the S10/spc region of this gene cluster. The gene encoding rpl36 is absent, in contrast to the case in other plastid DNAs. These genes, including the previously characterized genes rpl3, rpl2 and rps19, are transcribed as a primary transcript of approximately 7500 nucleotides. The occurrence of transcripts smaller than this presumptive primary transcript suggests that it is processed into defined segments. Transcription terminates 3' of rps5 where a 40 bp hairpin with one mismatch (-42.2 kcal) may be folded. Immediately downstream of rps5 an open reading frame, ORF492, is contained on a separate transcript. A comparison of gene content, operon structure and deduced amino acid sequence of the genes in the S10 and spc operons from different organisms supports the notion that cyanelles are intermediary between known plastids and cyanobacteria.
- Research Article
16
- 10.1111/j.1574-6968.2000.tb08912.x
- Dec 29, 1999
- FEMS microbiology letters
A ribosomal protein gene cluster from the spirochaete Leptospira interrogans was characterized. This locus is homologous to the Escherichia coli S10, spc, and alpha operons. Analysis of L. interrogans RNA showed that the ribosomal protein genes within this cluster are co-transcribed, thus forming an operon. Two transcription initiation sites were mapped by primer extension, upstream of fus, the first gene in this cluster, and sequences from this region provided promoter activity in E. coli. Transcription terminates near a predicted stem-loop structure following rplQ, the last gene in the cluster. These data suggest that two promoters upstream of fus direct transcription of this 17.5-kb ribosomal protein gene cluster. Comparison of the L. interrogans S10-spc-alpha cluster to homologous loci from Borrelia burgdorferi and Treponema pallidum provided evidence that this region of the genome underwent several rearrangements during spirochaete evolution.
- Research Article
80
- 10.1128/jb.138.2.383-396.1979
- May 1, 1979
- Journal of Bacteriology
Using ColE1-TnA hybrid plasmid RSF2124 as the cloning vector, we constructed a hybrid plasmid, pNO1001, which carried seven ribosomal protein (r-protein) genes in the spc operon together with their promoter. The plasmid also carried three r-protein genes which precede the spc operon, but did not carry the bacterial promoter for these genes. Expression of r-protein genes carried by pNO1001 was studied by measuring messenger ribonucleic acid and r-protein synthesis in cells carrying the plasmid. It was found that the messenger ribonucleic acid for all the promoter-distal r-protein genes was synthesized in large excess relative to messenger ribonucleic acid from other chromosomal r-protein genes which are not carried by the plasmid. However, only the two promoter-proximal r-proteins, L14 and L24, were markedly overproduced. The absence of large gene dosage effects on the synthesis of other distal proteins appeared to be due, at least in part, to preferential inactivation and/or degradation of the distal message which codes for these proteins; in addition, some preferential inhibition of translation of the distal message might also have been involved. Overproduced L14 and L24 were found to be degraded in recA+ strains at both 30 and 42 degrees C; in recA strains, the degradation took place at 42 degrees C but was very slow or absent at 30 degrees C. The recA strains carrying pNO1001 failed to form colonies at 30 degrees C, presumably because of overaccumulation of r-proteins. The results suggest that degradation of excess r-proteins is an important physiological process.
- Research Article
123
- 10.1016/0092-8674(78)90096-x
- Sep 1, 1978
- Cell
DNA sequences of promoter regions for the str and spc ribosomal protein operons in E. coli
- Research Article
101
- 10.1073/pnas.83.16.6030
- Aug 1, 1986
- Proceedings of the National Academy of Sciences
The nucleotide sequence of a tobacco (Nicotiana tabacum) chloroplast gene cluster that encodes eight proteins homologous to Escherichia coli ribosomal proteins L23, L2, S19, L22, S3, L16, L14, and S8 has been determined. RNA gel blot hybridization revealed that all eight coding regions are expressed in the chloroplasts. The arrangement of the eight genes resembles that found in the E. coli S10 and spc operons. Among the eight genes, the L2 and L16 genes contain 666- and 1020-base-pair introns, respectively. These intron boundary sequences are consistent with the conserved boundary sequences of the chloroplast group III introns [Shinozaki, K., Deno, H., Sugita, M., Kuramitsu, S. & Sugiura, M. (1986) Mol. Gen. Genet. 202, 1-5].
- Book Chapter
6
- 10.1007/978-1-4615-2894-4_2
- Jan 1, 1993
Ribosomal proteins (r-proteins) bind to specific sites on ribosomal RNA (rRNA) to assemble a ribosome. Except for L12/L7, there is always a stoichiometric amount of all r-proteins and rRNA and the amount of free ribosomal proteins is low in the cell. Thus a coordinated regulation must exist. For the vast majority of ribosomal proteins, the mechanism of coordinate expression is at the translational autoregulation level. Some exceptions to this rule are observed, e.g., in the S10 operon (regulated both by transcription attenuation and translation inhibition1), the spc operon,2 and the trmD operon (nonautoregulated).3,4 In most cases, each operon encoding a ribosomal protein encodes a ribosomal protein repressor (Fig. 1). This repressor binds generally specifically to the 5′ region of the messenger and stops the translation of all downstream cistrons. This phenomenon, called translational coupling, has been demonstrated for the Lll-L1 operon.
- Research Article
4
- 10.1016/s0378-1119(97)00072-3
- Jul 1, 1997
- Gene
Sequencing and analysis of the Thermus thermophilus ribosomal protein gene cluster equivalent to the spectinomycin operon
- Research Article
17
- 10.1128/jb.176.24.7703-7710.1994
- Dec 1, 1994
- Journal of Bacteriology
A segment of Thermotoga maritima DNA spanning 6,613 bp downstream from the gene tuf for elongation factor Tu was sequenced by use of a chromosome walking strategy. The sequenced region comprised a string of 14 tightly linked open reading frames (ORFs) starting 50 bp downstream from tuf. The first 11 ORFs were identified as homologs of ribosomal protein genes rps10, rpl3, rpl4, rpl23, rpl2, rps19, rpl22, rps3, rpl16, rpl29, and rps17 (which in Escherichia coli constitute the S10 operon, in that order); the last three ORFs were homologous to genes rpl14, rpl24, and rpl5 (which in E. coli constitute the three promoter-proximal genes of the spectinomycin operon). The 14-gene string was preceded by putative -35 and -10 promoter sequences situated 5' to gene rps10, within the 50-bp spacing between genes tuf and rps10; the same region exhibited a potential transcription termination signal for the upstream gene cluster (having tuf as the last gene) but displayed also the potential for formation of a hairpin loop hindering the terminator; this suggests that transcription of rps10 and downstream genes may start farther upstream. The similar organization of the sequenced rp genes in the deepest-branching bacterial phyla (T. maritima) and among Archaea has been interpreted as indicating that the S10-spc gene arrangement existed in the (last) common ancestor. The phylogenetic depth of the Thermotoga lineage was probed by use of r proteins as marker molecules: in all except one case (S3), Proteobacteria or the gram-positive bacteria, and not the genus Thermotoga, were the deepest-branching lineage; in only two cases, however, was the inferred branching order substantiated by bootstrap analysis.
- Research Article
143
- 10.1073/pnas.76.7.3411
- Jul 1, 1979
- Proceedings of the National Academy of Sciences
In an Escherichia coli strain lysogenic for lambda spc2 transducing phage, an extra copy of ribosomal protein (r-protein) genes in the spc and alpha operons are carried on the phage chromosome. Expression of genes in the spc operon in this merodiploid strain was compared with that in a control "haploid" strain carrying lambda trkA phage. It was found that the synthesis rate of spc mRNA, relative to other reference mRNA in the merodiploid strain, is about 2-fold higher than that in the control strain; yet, no dosage effect was observed in the synthesis rate of r-proteins in the spc or alpha operon. The spc mRNA was found to be more rapidly degraded in the merodiploid strain than in the control strain, and its steady-state amount, relative to reference mRNA, was only slightly higher in the merodiploid strain than in the control strain. Thus, E. coli cells have the ability to regulate the rate of r-protein synthesis regardless of the rate of transcription of r-protein genes, presumably by inactivation of the mRNA followed by its degradation. A model is proposed which involves selective inactivation of r-protein mRNA by a feedback mechanism. The model can explain coordinated synthesis of r-proteins and other observations related to selective expression of certain alleles in diploid strains.
- Research Article
4
- 10.1186/1471-2105-14-118
- Apr 4, 2013
- BMC Bioinformatics
BackgroundPyrrolysine (the 22nd amino acid) is in certain organisms and under certain circumstances encoded by the amber stop codon, UAG. The circumstances driving pyrrolysine translation are not well understood. The involvement of a predicted mRNA structure in the region downstream UAG has been suggested, but the structure does not seem to be present in all pyrrolysine incorporating genes.ResultsWe propose a strategy to predict pyrrolysine encoding genes in genomes of archaea and bacteria. We cluster open reading frames interrupted by the amber codon based on sequence similarity. We rank these clusters according to several features that may influence pyrrolysine translation. The ranking effects of different features are assessed and we propose a weighted combination of these features which best explains the currently known pyrrolysine incorporating genes. We devote special attention to the effect of structural conservation and provide further substantiation to support that structural conservation may be influential – but is not a necessary factor. Finally, from the weighted ranking, we identify a number of potentially pyrrolysine incorporating genes.ConclusionsWe propose a method for prediction of pyrrolysine incorporating genes in genomes of bacteria and archaea leading to insights about the factors driving pyrrolysine translation and identification of new gene candidates. The method predicts known conserved genes with high recall and predicts several other promising candidates for experimental verification. The method is implemented as a computational pipeline which is available on request.
- Research Article
25
- 10.1128/jb.00058-21
- May 7, 2021
- Journal of Bacteriology
Ribosomal proteins (RPs) are highly conserved across the bacterial and archaeal domains. Although many RPs are essential for survival, genome analysis demonstrates the absence of some RP genes in many bacterial and archaeal genomes. Furthermore, global transposon mutagenesis and/or targeted deletion showed that elimination of some RP genes had only a moderate effect on the bacterial growth rate. Here, we systematically analyze the evolutionary conservation of RPs in prokaryotes by compiling the list of the ribosomal genes that are missing from one or more genomes in the recently updated version of the Clusters of Orthologous Genes (COG) database. Some of these absences occurred because the respective genes carried frameshifts, presumably, resulting from sequencing errors, while others were overlooked and not translated during genome annotation. Apart from these annotation errors, we identified multiple genuine losses of RP genes in a variety of bacteria and archaea. Some of these losses are clade-specific, whereas others occur in symbionts and parasites with dramatically reduced genomes. The lists of computationally and experimentally defined non-essential ribosomal genes show a substantial overlap, revealing a common trend in prokaryote ribosome evolution that could be linked to the architecture and assembly of the ribosomes. Thus, RPs that are located at the surface of the ribosome and/or are incorporated at a late stage of ribosome assembly are more likely to be non-essential and to be lost during microbial evolution, particularly, in the course of genome compaction.IMPORTANCEIn many prokaryote genomes, one or more ribosomal protein (RP) genes are missing. Analysis of 1,309 prokaryote genomes included in the COG database shows that only about half of the RPs are universally conserved in bacteria and archaea. In contrast, up to 16 other RPs are missing in some genomes, primarily, tiny (<1 Mb) genomes of host-associated bacteria and archaea. Ten universal and nine archaea-specific ribosomal proteins show clear patterns of lineage-specific gene loss. Most of the RPs that are frequently lost from bacterial genomes are located on the ribosome periphery and are non-essential in Escherichia coli and Bacillus subtilis These results reveal general trends and common constraints in the architecture and evolution of ribosomes in prokaryotes.
- Research Article
675
- 10.1093/nar/gkn668
- Oct 23, 2008
- Nucleic Acids Research
The first bacterial genome was sequenced in 1995, and the first archaeal genome in 1996. Soon after these breakthroughs, an exponential rate of genome sequencing was established, with a doubling time of approximately 20 months for bacteria and approximately 34 months for archaea. Comparative analysis of the hundreds of sequenced bacterial and dozens of archaeal genomes leads to several generalizations on the principles of genome organization and evolution. A crucial finding that enables functional characterization of the sequenced genomes and evolutionary reconstruction is that the majority of archaeal and bacterial genes have conserved orthologs in other, often, distant organisms. However, comparative genomics also shows that horizontal gene transfer (HGT) is a dominant force of prokaryotic evolution, along with the loss of genetic material resulting in genome contraction. A crucial component of the prokaryotic world is the mobilome, the enormous collection of viruses, plasmids and other selfish elements, which are in constant exchange with more stable chromosomes and serve as HGT vehicles. Thus, the prokaryotic genome space is a tightly connected, although compartmentalized, network, a novel notion that undermines the ‘Tree of Life’ model of evolution and requires a new conceptual framework and tools for the study of prokaryotic evolution.
- Ask R Discovery
- Chat PDF