Abstract

Sequence patterns surrounding the translation initiation sites of Cyanobacterium were precisely analyzed by the hidden Markov model (HMM) based on the actual translation initiation sites. In a previous study, 72 actual protein coding regions and their translation initiation sites on the genome of Synechocystis sp. strain PCC6803 were determined by Sazuka et al. using protein two-dimensional electrophoresis and microsequening. In this work, we extracted the sequence patterns surrounding translation initiation sites as HMM using the computer program YEBIS. The constructed HMM could recognize all but one translation initiation site. The HMM contains an AG-rich region (5.7 bp on average), as the Shine-Dalgarno sequence exclusively contains purines, upstream of the translation initiation site (-9.7 position on average) and a CT rich region (4.2 bp on average) just upstream from the translation initiation site. In addition, we found that the second amino acid (-4.5,6) could be classified into two types, one of which had C as their second codon while another of which has a nucleotide distribution relatively similar to the distribution among amino acids in the 72 proteins. This fact corresponds well to our earlier finding that when the second nucleotide of the second amino acid of a translated protein was C, an initial methionine was processed and that otherwise the methionine was intact with high frequency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.