Codon usage bias (CUB) is the phenomenon of non-uniform usage of synonymous codons in which some codons are more used than others and it helps in understanding the molecular organization of genome. Bioinformatic approach was used to analyze the protein-coding sequences of genes associated with Parkinson's disease (PD) to explore compositional features and codon usage pattern as no details work was reported yet. The average improved effective number of codons (Nc) and Nc prime were 42.74 and 44.26 respectively, indicated that CUB was low in these genes. In most of the genes, the overall GC content was almost 50% and GC content at the 1st codon position was the highest while GC content at the 2nd codon position was lowest. Relative synonymous codon usage (RSCU) analysis elucidated over-represented (p > 1.6) and under-represented codons (p < 0.6). The GTG (Val) is the only codon over-represented in all genes. Over-represented codons except (GTG) were A or T ending while under-represented codons (except ACT) were G or C ending. The codons namely TTA (Leu), CTA (Leu), ATC (Ile), ATA (Ile), AGT (Ser), AAC (Asn), TGT (Cys), TGC (Cys), CGC (Arg), AGA (Arg), and AGG (Arg) were absent in SNCA1 to SNCA8 genes. The codon TCG (Ser) was absent in all genes except UCHL1 and PINK1. Correspondence analysis (COA) revealed that the pattern of codon usage differs among genes associated with PD. Neutrality plot analysis indicated some of the points are diagonal distribution suggested that mutation pressure influenced the CUB in genes associated with PD.
Read full abstract