Sago plant (Metroxylon sagu Rottb.) is one of the most carbohydrate-producing plants in the world. Microsatellites or simple sequence repeats (SSRs) play an important role in the genome and are used extensively compared to other molecular markers. For the first time, we are exploiting data expressed sequence tags (EST) of sago plants to identify and characterise markers in this species. EST data about sago plants are obtained through the EST database on the National Center for Biotechnology Information (NCBI) website. We obtained data of 458 Kb (412 contig) with a maximum and minimum length of 1,138 and 124 nucleotides, respectively. We successfully identified 820 perfectly patterned SSR using Phobos 3.3.12 software. The type characterisation of EST-SSR was dominated by tri-nucleotides 36% (294), followed by hexa-nucleotides 24% (202), tetra-nucleotides 15% (120), penta-nucleotides 13% (108) and di-nucleotides 12% (96). The most frequency of SSR motifs in each type is AG, AAG and AAAG. Analysis of synteny on the EST sequence with the online application Phytozome found that sequences were distributed on 12 Oryza sativa chromosomes with a likeness percentage between 63% to 100% and e-value between 0 to 0.094. We developed the primer and generated 19 primers. Furthermore, we validated 7 primers that all generated polymorphic alleles. To our knowledge, this report is the first identification and characterisation of EST-SSR for sago species and these markers can be used for genetic diversity analysis, marker assisted selection (MAS), cultivar identification, kinship analysis and genetic mapping analysis.
Read full abstract