Abstract

The Chinese lake gudgeon Sarcocheilichthys sinensis is a small cyprinid fish with great aquaculture potential both for its edible and ornamental values. Nevertheless, available genomic and transcriptomic information for this fish is extremely deficient. In this study, a normalized cDNA library was constructed using 13 mixed tissues of an adult male S. sinensis, and was sequenced by the Illumina HiSeq2500 platform. De novo assembly was performed using 38,911,511 obtained clean reads, and a total of 147,282 unigenes with an average length of 900 bp were finally achieved. 96.2% of these unigenes were annotated in 9 public databases, and 16 segments of growth-related genes were identified for future studies. In addition, 28,493 unigenes were assigned to 61 subcategories of Gene Ontology (GO), and 10,483 unigenes were assigned to 25 categories of Cluster of Orthologous Group (COG). Moreover, 14,943 unigenes were classified into 225 pathways of the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. A total of 30,666 microsatellites were detected from 17,627 unigenes with an average distribution density of 1:2405 bp. This transcriptome data set will be valuable for researches on discovery, expression and evolution on genes of interest. Meanwhile, the identified microsatellites would be useful tools for genetic and genomic studies in S. sinensis.

Highlights

  • With the development of high-throughput sequencing technologies, genetic and genomic information is much easier to obtain for non-model organisms than ever before

  • Using the high-throughput Illumina HiSeq 2500 platform, the transcriptome of 13 mixed tissues was sequenced in S. sinensis

  • The data obtained in this study could help us to understand the transcriptome of adult S. sinensis on overall, and information of unigenes will be valuable for further researches on genes of interest, such as those related to growth, immunity and physiological adaptation

Read more

Summary

Introduction

With the development of high-throughput sequencing technologies, genetic and genomic information is much easier to obtain for non-model organisms than ever before. Transcriptome analysis is an efficient method for genome survey, massive functional gene identification and molecular marker isolation [1]. Transcriptomic sequences are all coding DNA and have a much higher rate of functional information; they are more helpful in revealing molecular mechanisms of functional genes [2,3]. RNA sequencing (RNA-Seq), such as Illumina, Roche 454 and Solexa, is a powerful high-throughput sequencing approach, and has been applied to acquire massive of transcriptomic information. Characterization and annotation for the transcriptome of Sarcocheilichthys sinensis role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.