Abstract

Unequal usage of synonymous codons is known as codon usage bias (CUB), which is generally different between the high-expression genes (HEG) and low-expression genes (LEG) in organisms is not yet adequately reported across different bacteria. In this study, a machine learning-based approach was implemented initially to find out codons that are significantly different between the HEG and LEG in Escherichia coli. It identified Cys codons such as UGU and UGC, Lys codons such as AAA and AAG that were least influenced by gene expression. Codons such as UCU (Ser), CUG (Leu), GGG (Gly), CGG (Arg) etc. were identified to be influenced maximum by the gene expression. The study was extended to analyze codon usage in 683 other bacterial species. Cys (UGU/UGC) and Ser (AGU/AGC) codons were identified being the least different between the two groups of genes across these bacterial species. Codons such as CGA, CUG, GGG, GCC, ACC, AUA, and AUC were identified to be influenced by the gene expression across majority of these species. This study supports the role of CUB on gene expression across bacteria and demonstrates a commonality among bacteria regarding behavior of certain codons with regard to gene expression.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.