Abstract

The availability of large expressed sequence tag (EST) and whole genome databases of oil palm enabled the development of a data base of microsatellite markers. For this purpose, an EST database consisting of 40,979 EST sequences spanning 27 Mb and a chromosome-wise whole genome databases were downloaded. A total of 3,950 primer pairs were identified and developed from EST sequences. The tri and tetra nucleotide repeat motifs were most prevalent (each 24.75%) followed by di-nucleotide repeat motifs. Whole genome-wide analysis found a total of 245,654 SSR repeats across the 16 chromosomes of oil palm, of which 38,717 were compound microsatellite repeats. A web application, OpSatdb, the first microsatellite database of oil palm, was developed using the PHP and MySQL database (https://ssr.icar.gov.in/index.php). It is a simple and systematic web-based search engine for searching SSRs based on repeat motif type, repeat type, and primer details. High synteny was observed between oil palm and rice genomes. The mapping of ESTs having SSRs by Blast2GO resulted in the identification of 19.2% sequences with gene ontology (GO) annotations. Randomly, a set of ten genic SSRs and five genomic SSRs were used for validation and genetic diversity on 100 genotypes belonging to the world oil palm genetic resources. The grouping pattern was observed to be broadly in accordance with the geographical origin of the genotypes. The identified genic and genome-wide SSRs can be effectively useful for various genomic applications of oil palm, such as genetic diversity, linkage map construction, mapping of QTLs, marker-assisted selection, and comparative population studies.

Highlights

  • The availability of large expressed sequence tag (EST) and whole genome databases of oil palm enabled the development of a data base of microsatellite markers

  • The objectives of present study are (1) in silico mining of genic and whole genome-wide microsatellites of oil palm, along with their frequency and distribution analysis; (2) validation, polymorphism and genetic diversity analysis of genic and genome-wide simple sequence repeat (SSR) markers among 100 oil palm genetic resources belongs to 18 accessions; (3) functional annotation of the EST sequences; and (4) design and development of a web application microsatellite database of oil palm

  • The present study is the first report on the identification of EST-based SSRs (EST-SSRs) using a large number of EST sequences available in the database of oil palm

Read more

Summary

Introduction

The availability of large expressed sequence tag (EST) and whole genome databases of oil palm enabled the development of a data base of microsatellite markers. The objectives of present study are (1) in silico mining of genic and whole genome-wide microsatellites of oil palm, along with their frequency and distribution analysis; (2) validation, polymorphism and genetic diversity analysis of genic and genome-wide SSR markers among 100 oil palm genetic resources belongs to 18 accessions; (3) functional annotation of the EST sequences; and (4) design and development of a web application microsatellite database of oil palm.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call