Synonymous codon usage defines functional gene families.

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

The degeneracy of the genetic code is increasingly recognized for roles in regulating translation rate, protein folding, and cell response. However, the functional genomics of codon usage patterns remains poorly defined. We previously showed that prokaryotic and eukaryotic cells respond to individual stresses by uniquely reprogramming the tRNA pool and the dozens of tRNA modifications comprising the tRNA epitranscriptome to cause selective translation of mRNAs from codon-biased stress response genes. Here, we tested the hypothesis that functional gene families have distinct values of codon bias in the Saccharomyces cerevisiae genome by modeling isoacceptor codon distributions using a new approach-analysis of synonymous codon signatures (ASCS). Application of ASCS to the S. cerevisiae genome revealed linear relationships between patterns of codon bias and gene function using canonical correlation analysis. By mapping codon-biased open reading frames (ORFs) onto a functional network of gene ontology (GO) categories, we identified 91 gene families distinguished by unique codon usage signatures. The codon usage patterns were found to strongly predict functional clusters of genes, such as translational machinery, transcription, and metabolic processes. The ASCS-derived model of codon usage patterns in S. cerevisiae reveals functional codon bias signatures and captures more biologically meaningful information when compared to other codon analytical approaches.

Similar Papers
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 43
  • 10.1074/jbc.m113.467977
Genetic Code-guided Protein Synthesis and Folding in Escherichia coli
  • Oct 1, 2013
  • Journal of Biological Chemistry
  • Shaoliang Hu + 3 more

Universal genetic codes are degenerated with 61 codons specifying 20 amino acids, thus creating synonymous codons for a single amino acid. Synonymous codons have been shown to affect protein properties in a given organism. To address this issue and explore how Escherichia coli selects its "codon-preferred" DNA template(s) for synthesis of proteins with required properties, we have designed synonymous codon libraries based on an antibody (scFv) sequence and carried out bacterial expression and screening for variants with altered properties. As a result, 342 codon variants have been identified, differing significantly in protein solubility and functionality while retaining the identical original amino acid sequence. The soluble expression level varied from completely insoluble aggregates to a soluble yield of ~2.5 mg/liter, whereas the antigen-binding activity changed from no binding at all to a binding affinity of > 10(-8) m. Not only does our work demonstrate the involvement of genetic codes in regulating protein synthesis and folding but it also provides a novel screening strategy for producing improved proteins without the need to substitute amino acids.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 178
  • 10.1371/journal.pone.0090905
Genome-Wide Analysis of Codon Usage and Influencing Factors in Chikungunya Viruses
  • Mar 4, 2014
  • PLoS ONE
  • Azeem Mehmood Butt + 2 more

Chikungunya virus (CHIKV) is an arthropod-borne virus of the family Togaviridae that is transmitted to humans by Aedes spp. mosquitoes. Its genome comprises a 12 kb single-strand positive-sense RNA. In the present study, we report the patterns of synonymous codon usage in 141 CHIKV genomes by calculating several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis showed that the preferred synonymous codons were G/C and A-ended. A comparative analysis of RSCU between CHIKV and its hosts showed that codon usage patterns of CHIKV are a mixture of coincidence and antagonism. Similarity index analysis showed that the overall codon usage patterns of CHIKV have been strongly influenced by Pan troglodytes and Aedes albopictus during evolution. The overall codon usage bias was low in CHIKV genomes, as inferred from the analysis of effective number of codons (ENC) and codon adaptation index (CAI). Our data suggested that although mutation pressure dominates codon usage in CHIKV, patterns of codon usage in CHIKV are also under the influence of natural selection from its hosts and geography. To the best of our knowledge, this is first report describing codon usage analysis in CHIKV genomes. The findings from this study are expected to increase our understanding of factors involved in viral evolution, and fitness towards hosts and the environment.

  • Research Article
  • Cite Count Icon 1
  • 10.5897/ajmr11.697
Synonymous codon usage bias of spike genes of porcine epidemic diarrhea virus
  • Oct 16, 2011
  • African Journal of Microbiology Research
  • Cao H W

Synonymous codons are not used randomly. Rather, some codons are used more frequently than others. Investigating codon usage bias is essential to the understanding of viral evolution. However the factors shaping synonymous codon usage bias and nucleotide composition in animal viruses have been studied only to a limited extent. In our study, using the 19 complete CDS sequences of Spike (S) genes of porcine epidemic diarrhea virus (PEDV), we analyzed synonymous codon usage bias. The relative synonymous codon usage (RSCU) was used to estimate codon usage variation in each gene, and the results indicated that preferentially used codons were A-ended, G-ended, and U-ended codons. Effective number of codons (ENC) values varies from 48.15 to 49.52, which suggested that the codon usage bias in PEDV S genes was very slightly. Correspondence analysis (COA) was further performed to study the major trend in codon usage variation, and the plot of ENC values against GC3s (at synonymous third codon position) revealed that mutational pressure rather than translational selection was the main factor determining the codon usage bias in PEDV S genes. Moreover, correlation analysis indicated that aromaticity and hydrophobicity of S genes also influenced the codon usage variation in a minor way. Our study provides the most comprehensive analysis to date of PEDV S genes’ codon usage patterns and provides a basic understanding of the mechanisms for codon usage bias. Key words: Porcine epidemic diarrhea virus, Spike gene, relative synonymous codon usage, effective number of codons, correspondence analysis, correlation analysis.

  • Research Article
  • Cite Count Icon 184
  • 10.1093/nar/gkf546
Synonymous codon usage is subject to selection in thermophilic bacteria.
  • Oct 1, 2002
  • Nucleic Acids Research
  • D J Lynn

The patterns of synonymous codon usage, both within and among genomes, have been extensively studied over the past two decades. Despite the accumulating evidence that natural selection can shape codon usage, it has not been possible to link a particular pattern of codon usage to a specific external selective force. Here, we have analyzed the patterns of synonymous codon usage in 40 completely sequenced prokaryotic genomes. By combining the genes from several genomes (more than 80 000 genes in all) into a single dataset for this analysis, we were able to investigate variations in codon usage, both within and between genomes. The results show that synonymous codon usage is affected by two major factors: (i) the overall G+C content of the genome and (ii) growth at high temperature. This study focused on the relationship between synonymous codon usage and the ability to grow at high temperature. We have been able to eliminate both phylogenetic history and lateral gene transfer as possible explanations for the characteristic pattern of codon usage among the thermophiles. Thus, these results demonstrate a clear link between a particular pattern of codon usage and an external selective force.

  • Research Article
  • 10.1007/s40011-014-0457-x
In silico Analysis of Synonymous Codon Usage Pattern of Rhizobium etli CFN 42
  • Nov 19, 2014
  • Proceedings of the National Academy of Sciences, India Section B: Biological Sciences
  • Mohammad Samir Farooqi + 6 more

Understanding the extent and pattern of codon bias and the forces affecting codon usage are the key steps towards elucidating choice of codons at the level of individual genes. In the present study, codon usage pattern among 3,703 genes of nitrogen fixing bacterium, Rhizobium etli CFN 42 was analyzed. The study aims to identify the factors responsible for codon usage bias and highly expressed genes of this bacterium. Multivariate statistical analysis revealed a single major explanatory axis accounting for codon usage variation among the genes. Correlation analysis suggested that the axis has strong significant positive correlation with GC3s i.e. GC content at the third codon position. A significant negative correlation between effective number of codons and GC3s content was observed suggesting that the codon usage was affected by gene nucleotide composition. These findings suggested mutational role as the major factor in shaping codon usage bias among the genes. Further, correspondence analysis of Relative Synonymous Codon Usage revealed cluster of highly expressed genes. Notably, 26 codons were determined as the ‘optimal codons’ that were significantly more frequent among the highly expressed genes tested by χ2 test (P < 0.01). Such results may add value to the efforts of developing bio-fertilizers based on symbiotic or non-symbiotic bacteria for improving soil fertility.

  • Research Article
  • Cite Count Icon 2
  • 10.1002/jobm.202200072
Dispersion of synonymous codon usage patterns in hepatitis E virus genomes derived from various hosts.
  • Jul 1, 2022
  • Journal of basic microbiology
  • Xin Wang + 3 more

Hepatitis E virus (HEV) is an important zoonotic pathogen infecting a wide range of host species. It has a positive-sense, single-stranded RNA genome encoding three open reading frames (ORFs). Synonymous codon usages of viruses essentially determine their survival and adaptation to susceptible hosts. To better understand the interplay between the ever-expanding host range and synonymous codon usages of HEV, we quantified the dispersion of synonymous codon usages of HEV genomes isolated from different hosts via Vs calculation and information entropy. HEV ORFs show species-specific synonymous codon usage patterns. Ruminant-derived HEV ORFs own the most synonymous codons with stable usage patterns (Vs value <0.1) which leads to the stable overall codon usage patterns (R valuebeing close to zero). Swine-derived HEV ORFs own more concentrated synonymous codons than those from wild boar. Compared with HEV strains isolated from other hosts, the human-derived HEV exhibits a distinct pattern at the overall codon usage (R < 0). Generally, ORF1 contains more synonymous codons with stable usage patterns (Vs < 0.1) than those of ORFs 2 and 3. Moreover, ORF3 contains more synonymous codons with varied patterns (Vs > 1.0) than ORFs 1 and 2. The host factor serving as one of the evolutionary dynamics probably influences synonymous codon usage patterns of theHEV genome. Taken together, synonymous codons with stable usage patterns in ORF1 might help to sustain the infection, while that with varied usage patterns in ORF3 may facilitate cross-species infection and expand the host range.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 56
  • 10.1186/1743-422x-8-146
Analysis of codon usage and nucleotide composition bias in polioviruses
  • Mar 30, 2011
  • Virology Journal
  • Jie Zhang + 8 more

BackgroundPoliovirus, the causative agent of poliomyelitis, is a human enterovirus and a member of the family of Picornaviridae and among the most rapidly evolving viruses known. Analysis of codon usage can reveal much about the molecular evolution of the viruses. However, little information about synonymous codon usage pattern of polioviruses genome has been acquired to date.MethodsThe relative synonymous codon usage (RSCU) values, effective number of codon (ENC) values, nucleotide contents and dinucleotides were investigated and a comparative analysis of codon usage pattern for open reading frames (ORFs) among 48 polioviruses isolates including 31 of genotype 1, 13 of genotype 2 and 4 of genotype 3.ResultsThe result shows that the overall extent of codon usage bias in poliovirus samples is low (mean ENC = 53.754 > 40). The general correlation between base composition and codon usage bias suggests that mutational pressure rather than natural selection is the main factor that determines the codon usage bias in those polioviruses. Depending on the RSCU data, it was found that there was a significant variation in bias of codon usage among three genotypes. Geographic factor also has some effect on the codon usage pattern (exists in the genotype-1 of polioviruses). No significant effect in gene length or vaccine derived polioviruses (DVPVs), wild viruses and live attenuated virus was observed on the variations of synonymous codon usage in the virus genes. The relative abundance of dinucleotide (CpG) in the ORFs of polioviruses are far below expected values especially in DVPVs and attenuated virus of polioviruses genotype 1.ConclusionThe information from this study may not only have theoretical value in understanding poliovirus evolution, especially for DVPVs genotype 1, but also have potential value for the development of poliovirus vaccines.

  • Research Article
  • Cite Count Icon 14
  • 10.1007/s11274-019-2693-y
Comprehensive profiling of codon usage signatures and codon context variations in the genus Ustilago.
  • Jul 22, 2019
  • World Journal of Microbiology and Biotechnology
  • Ayan Roy + 1 more

The fungal genus Ustilago consists of intimidating pathogens associated with disease manifestations in plants of agricultural importance and gravity. Rapid progress of genome sequencing has opened the floodgates for biological research. Availability of Ustilago genomes provides a scope to explore complex codon and amino acid usage patterns in the genus. An extensive scrutiny of the factors underlying the complex modalities of codon and amino acid usage in Ustilago has been executed in the present analysis. Multivariate statistical analysis revealed a dominant effect of natural selection pressure, aimed at translational accuracy, to be operative on the codon usage behavior. Subtle impact of GC compositional constraint was also evident on the codon usage patterns. Gene expressivity was inferred to be the most crucial determinant governing observed codon usage variations. Amino acid usage patterns were found to be significantly governed by aromatic and hydrophobic characters of the encoded proteins. GC content and length of protein coding sequences also had considerable influence on the amino acid usage signatures. Extensive analysis of codon context variations revealed that UpA dinucleotides were strictly avoided at the codon-codon junctions (cP3-cA1) which might be attributed to reduce the risk of nonsense mutations and subsequently, improve translational finesse. Identification of the optimal codons, employed preferentially among the genes with high expressivity, and estimation of preferred and avoided codon pairs in Ustilago promises to be useful pertaining to mutational experiments at the codonic level, targeted to thwart the growth of Ustilago and combat associated pathogenesis.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 4
  • 10.1186/s43141-022-00319-2
Decoding the codon usage patterns in Y-domain region of hepatitis E viruses
  • Apr 11, 2022
  • Journal of Genetic Engineering & Biotechnology
  • Zoya Shafat + 3 more

BackgroundHepatitis E virus (HEV) is a positive-sense RNA virus belonging to the family Hepeviridae. The genome of HEV is organized into three open-reading frames (ORFs): ORF1, ORF2, and ORF3. The ORF1 non-structural Y-domain region (YDR) has been demonstrated to play an important role in the HEV pathogenesis. The nucleotide composition, synonymous codon usage bias in conjunction with other factors influencing the viral YDR genes of HEV have not been studied. Codon usage represents a significant mechanism in establishing the host-pathogen relationship. The present study for the first time elucidates the detailed codon usage patterns of YDR among HEV and HEV-hosts (Human, Rabbit, Mongoose, Pig, Wild boar, Camel, Monkey). ResultsThe overall nucleotide composition revealed the abundance of C and U nucleotides in YDR genomes. The relative synonymous codon usage (RSCU) analysis indicated biasness towards C and U over A and G ended codons in HEV across all hosts. Codon frequency comparative analyses among HEV-hosts showed both similarities and discrepancies in usage of preferred codons encoding amino acids, which revealed that HEV codon preference neither completely differed nor completely showed similarity with its hosts. Thus, our results clearly indicated that the synonymous codon usage of HEV is a mixture of the two types of codon usage: coincidence and antagonism. Mutation pressure from virus and natural selection from host seems to be accountable for shaping the codon usage patterns in YDR. The study emphasised that the influence of compositional constraints, codon usage biasness, mutational alongside the selective forces were reflected in the occurrence of YDR codon usage patterns. ConclusionsOur study is the first in its kind to have reported the analysis of codon usage patterns on a total of seven different natural HEV hosts. Therefore, knowledge of preferred codons obtained from our study will not only augment our understanding towards molecular evolution but is also envisaged to provide insight into the efficient viral expression, viral adaptation, and host effects on the HEV YDR codon usage.

  • Research Article
  • Cite Count Icon 4
  • 10.1002/jobm.201500233
Genome-wide analysis of synonymous codon usage in Huaiyangshan virus and other bunyaviruses.
  • Jul 15, 2015
  • Journal of Basic Microbiology
  • Xuelian Luo + 5 more

Huaiyangshan virus (HYSV) is a newly discovered bunyavirus, which is transmitted by ticks and causes hemorrhagic fever-like illness in human. The interplay of codon usage among viruses and their hosts is expected to affect viral survival, evasion from host's immune system and evolution. However, little is known about the codon usage in HYSV genome. In the present study, we analyzed synonymous codon usage in 120 available full-length HYSV sequences and performed a comparative analysis of synonymous codon usage patterns in HYSV and 42 other bunyaviruses. The relative synonymous codon usage (RSCU) analysis showed that the preferred synonymous codons were G/C-ended. A comparative analysis of RSCU between HYSV and its hosts reflected that codon usage patterns of HYSV were mostly coincident with that of its hosts. Our data suggested that although mutational bias dominated codon usage, patterns of codon usage in HYSV were also under the influence of nature selection. Phylogenetic analysis based on RSCU values across different HYSV strains and 42 other bunyaviruses suggested that codon usage pattern in HYSV was the most similar with that of Uukuniemi virus among these bunyaviruses and that viruses belonged to Phlebovirus showed a diversity of codon usage patterns.

  • Research Article
  • Cite Count Icon 2
  • 10.3390/ijms19124010
Synonymous Codon Usages as an Evolutionary Dynamic for Chlamydiaceae.
  • Dec 12, 2018
  • International Journal of Molecular Sciences
  • Zhaocai Li + 5 more

The family of Chlamydiaceae contains a group of obligate intracellular bacteria that can infect a wide range of hosts. The evolutionary trend of members in this family is a hot topic, which benefits our understanding of the cross-infection of these pathogens. In this study, 14 whole genomes of 12 Chlamydia species were used to investigate the nucleotide, codon, and amino acid usage bias by synonymous codon usage value and information entropy method. The results showed that all the studied Chlamydia spp. had A/T rich genes with over-represented A or T at the third positions and G or C under-represented at these positions, suggesting that nucleotide usages influenced synonymous codon usages. The overall codon usage trend from synonymous codon usage variations divides the Chlamydia spp. into four separate clusters, while amino acid usage divides the Chlamydia spp. into two clusters with some exceptions, which reflected the genetic diversity of the Chlamydiaceae family members. The overall codon usage pattern represented by the effective number of codons (ENC) was significantly positively correlated to gene GC3 content. A negative correlation exists between ENC and the codon adaptation index for some Chlamydia species. These results suggested that mutation pressure caused by nucleotide composition constraint played an important role in shaping synonymous codon usage patterns. Furthermore, codon usage of T3ss and Pmps gene families adapted to that of the corresponding genome. Taken together, analyses help our understanding of evolutionary interactions between nucleotide, synonymous codon, and amino acid usages in genes of Chlamydiaceae family members.

  • Research Article
  • Cite Count Icon 25
  • 10.3390/cancers13112739
Insight into Codon Utilization Pattern of Tumor Suppressor Gene EPB41L3 from Different Mammalian Species Indicates Dominant Role of Selection Force
  • Jun 1, 2021
  • Cancers
  • Utsang Kumar + 11 more

Simple SummaryThe present study envisaged the codon usage pattern analysis of tumor suppressor gene EPB41L3 for the human, brown rat, domesticated cattle, and Sumatran orangutan. Most amino acids are coded by more than one synonymous codon, but they are used in a biased manner. The codon usage bias results from multiple factors like compositional properties, dinucleotide abundance, neutrality, parity, tRNA pool, etc. Understanding codon bias is central to fields as diverse as molecular evolution, gene expressivity, protein translation, and protein folding. This kind of studies is important to see the effects of various evolutionary forces on codon usage. The present study indicated that the selection force is dominant over other forces shaping codon usage in the envisaged organisms.Uneven codon usage within genes as well as among genomes is a usual phenomenon across organisms. It plays a significant role in the translational efficiency and evolution of a particular gene. EPB41L3 is a tumor suppressor protein-coding gene, and in the present study, the pattern of codon usage was envisaged. The full-length sequences of the EPB41L3 gene for the human, brown rat, domesticated cattle, and Sumatran orangutan available at the NCBI were retrieved and utilized to analyze CUB patterns across the selected mammalian species. Compositional properties, dinucleotide abundance, and parity analysis showed the dominance of A and G whilst RSCU analysis indicated the dominance of G/C-ending codons. The neutrality plot plotted between GC12 and GC3 to determine the variation between the mutation pressure and natural selection indicated the dominance of selection pressure (R = 0.926; p < 0.00001) over the three codon positions across the gene. The result is in concordance with the codon adaptation index analysis and the ENc-GC3 plot analysis, as well as the translational selection index (P2). Overall selection pressure is the dominant pressure acting during the evolution of the EPB41L3 gene.

  • Research Article
  • Cite Count Icon 67
  • 10.1016/j.cels.2021.05.019
Degradation of host translational machinery drives tRNA acquisition in viruses.
  • Aug 1, 2021
  • Cell systems
  • Joy Y Yang + 8 more

Degradation of host translational machinery drives tRNA acquisition in viruses.

  • Research Article
  • 10.1504/ijsise.2016.10000773
Analysis of codon bias pattern for genome sequences through a three-phase network model
  • Jan 1, 2016
  • International Journal of Signal and Imaging Systems Engineering
  • Ahsan Z Rizvi + 1 more

Codon bias generates specific patterns of synonymous codons usage in genomes of all texta. Codon bias patterns of genomes are extracted here in spectral domain and named as Codon Bias Spectra (CBS). Such spectral models of codon bias are demonstrated through a three-phase network model of various genomes. GC, GC3 and AT3 bias patterns along with codon spectra are shown in the paper by modifying the parameters in this spectral model. Patterns of CBS are studied here for codon bias in prokaryotic genomes. Correlation and regression analysis of CBS show the influencing effects of GC, GC3 and AT3 bias. Analysis of the area under ROC curve (AUC) is performed for testing the statistical significance of the three-phase network model for identifying codon bias locations in genome.

  • Research Article
  • Cite Count Icon 44
  • 10.1016/j.biosystems.2005.12.004
Factors influencing synonymous codon and amino acid usage biases in Mimivirus
  • Jan 24, 2006
  • Biosystems
  • K Sau + 4 more

Factors influencing synonymous codon and amino acid usage biases in Mimivirus

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.