Abstract

BackgroundDanshen (Salvia miltiorrhiza Bunge), also known as Chinese red sage, is a member of Lamiaceae family. It is valued in traditional Chinese medicine, primarily for the treatment of cardiovascular and cerebrovascular diseases. Because of its pharmacological potential, ongoing research aims to identify novel bioactive compounds in danshen, and their biosynthetic pathways. To date, only expressed sequence tag (EST) and RNA-seq data for this herbal plant are available to the public. We therefore propose that the construction of a reference genome for danshen will help elucidate the biosynthetic pathways of important secondary metabolites, thereby advancing the investigation of novel drugs from this plant.FindingsWe assembled the highly heterozygous danshen genome with the help of 395 × raw read coverage using Illumina technologies and about 10 × raw read coverage by using single molecular sequencing technology. The final draft genome is approximately 641 Mb, with a contig N50 size of 82.8 kb and a scaffold N50 size of 1.2 Mb. Further analyses predicted 34,598 protein-coding genes and 1,644 unique gene families in the danshen genome.ConclusionsThe draft danshen genome will provide a valuable resource for the investigation of novel bioactive compounds in this Chinese herb.Electronic supplementary materialThe online version of this article (doi:10.1186/s13742-015-0104-3) contains supplementary material, which is available to authorized users.

Highlights

  • Danshen (Salvia miltiorrhiza Bunge), known as Chinese red sage, is a member of Lamiaceae family

  • The draft danshen genome will provide a valuable resource for the investigation of novel bioactive compounds in this Chinese herb

  • Danshen genomic DNA sequencing on Illumina platforms Genomic DNA was extracted from the leaf tissues of a single danshen plant using the cetyltrimethylammonium bromide (CTAB) method

Read more

Summary

Introduction

Danshen (Salvia miltiorrhiza Bunge), known as Chinese red sage, is a member of Lamiaceae family. Sequencing statistics for all libraries are outlined in Additional file 1: Table S1. With the removal of low quality and duplicated reads, ~147 Gb of clean data were obtained for the de novo assembly of the danshen genome.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call