Abstract

DNA is widely used to construct heterologously expressed genes. The adaptation of the codons to the host organism is necessary in order to ensure sufficient production of proteins. The GC content, codon identity and the mRNA from the translation site are also important in the design of the gene construct. This study performed a strategy for the design of synthetic gene encoding HPV52 L1 protein and several analyses at the genetic level to optimize its protein expression in the Escherichia coli BL21(DE3) host. The determination of the codon optimization was performed by collecting 75 HPV52 L1 protein sequences in the NCBI database. Furthermore, all the sequences were analyzed using multiple global alignments by Clustal Omega web server. Once the model was determined, codon optimization was performed using OPTIMIZER and the web server of the IDT codon optimization tool based on the E. Coli B. The generated open reading frame (ORF) sequence was analyzed using Restriction mapper web server to choose the restriction site for facilitating the cloning stage, which is adjusted for pJExpress414 expression vector. To maximize the protein expression level, the mRNA secondary structure analysis around the ribosome binding site (rbs) was performed. A slight modification at the 5’-terminal end waa carried out in order to get more accessible rbs and increasing mRNA folding free energy. Finally, the construction of the synthetic gene was confirmed to ensure that no mutation occurs in the protein and to calculate its Codon Adaptation Index (CAI) and GC content. The above strategy, which leads to a good ORF sequence with the value of the free mRNA folding energy around rbs, is -5.5 kcal / mol, CAI = 0.787 and GC content 49.5%. This result is much better than its original gene. This result is much better compared to its native gene. Theoretically it is possible that this synthetic gene construct generates a high level protein expression in E. coli BL21 (DE3) under the regulation of the T7 promoter.

Highlights

  • Nowadays, synthetic DNA is widely used to construct the gene for heterologous protein expression

  • The HPV52 L1 protein sequence having 100% identity discovered from this analysis was selected for a template in codon optimization step

  • The HPV52 L1 protein sequence was selected for codon optimization by using web server (Puigbò et al, 2007)

Read more

Summary

INTRODUCTION

Synthetic DNA is widely used to construct the gene for heterologous protein expression. The modification of the native DNA sequences is mostly introduced into the synthetic gene to maximize the expression, especially in the heterologous host (Quan et al, 2011; Gaspar et al, 2012; Luo et al, 2016). Several factors should be carefully considered when designing the construct, including codon usage, codon identity, GC-content, mRNA folding free energy, especially around the ribosome binding site (RBS) These play a role in the production of high-level recombinant proteins in Escherichia coli There are many online software packages available for performing codon optimization for designing the synthetic gene. To the best of our knowledge, this study is the only one that discusses the codon optimization strategy and the construction of the synthetic gene encoding L1 major capsid protein of HPV52 for E. coli expression system

MATERIALS AND METHODS
RESULTS AND DISCUSSION
CONCLUSION
Methods
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.