Generation and analysis of a 29,745 unique Expressed Sequence Tags from the Pacific oyster (Crassostrea gigas) assembled into a publicly accessible database: the GigasDatabase

Elodie Fleury,Dario Moraga,Viviane Boulo,Julien De Lorgeril,Evelyne Bachère,Yannick Gueguen,Patrick Wincker,Jeanne Moal,Arnaud Tanguy,Richard Reinhardt,Christophe Lelong,Pascal Favrel,Charlotte Corporeau,François Moreews,Arnaud Huvet,Patrick Prunet,Pierre Boudry,Michel Mathieu,Grace Davey,Sylvie Lapègue,Christophe Klopp,Christopher Sauvage,Caroline Fabioux,J Shaw ,Penelope K Lindeque ,Frédérick Gavory

doi:10.1186/1471-2164-10-341

Abstract

BackgroundAlthough bivalves are among the most-studied marine organisms because of their ecological role and economic importance, very little information is available on the genome sequences of oyster species. This report documents three large-scale cDNA sequencing projects for the Pacific oyster Crassostrea gigas initiated to provide a large number of expressed sequence tags that were subsequently compiled in a publicly accessible database. This resource allowed for the identification of a large number of transcripts and provides valuable information for ongoing investigations of tissue-specific and stimulus-dependant gene expression patterns. These data are crucial for constructing comprehensive DNA microarrays, identifying single nucleotide polymorphisms and microsatellites in coding regions, and for identifying genes when the entire genome sequence of C. gigas becomes available.DescriptionIn the present paper, we report the production of 40,845 high-quality ESTs that identify 29,745 unique transcribed sequences consisting of 7,940 contigs and 21,805 singletons. All of these new sequences, together with existing public sequence data, have been compiled into a publicly-available Website http://public-contigbrowser.sigenae.org:9090/Crassostrea_gigas/index.html. Approximately 43% of the unique ESTs had significant matches against the SwissProt database and 27% were annotated using Gene Ontology terms. In addition, we identified a total of 208 in silico microsatellites from the ESTs, with 173 having sufficient flanking sequence for primer design. We also identified a total of 7,530 putative in silico, single-nucleotide polymorphisms using existing and newly-generated EST resources for the Pacific oyster.ConclusionA publicly-available database has been populated with 29,745 unique sequences for the Pacific oyster Crassostrea gigas. The database provides many tools to search cleaned and assembled ESTs. The user may input and submit several filters, such as protein or nucleotide hits, to select and download relevant elements. This database constitutes one of the most developed genomic resources accessible among Lophotrochozoans, an orphan clade of bilateral animals. These data will accelerate the development of both genomics and genetics in a commercially-important species with the highest annual, commercial production of any aquatic organism.

Highlights

Bivalves are among the most-studied marine organisms because of their ecological role and economic importance, very little information is available on the genome sequences of oyster species
Several factors motivate further development of genomic resources for C. gigas: (I) Because this species has the highest annual production of any aquatic organism, C. gigas has been the subject of a great deal of research to elucidate the molecular basis underlying the physiological and genetic mechanisms of economicallyrelevant traits. (II) The Pacific oyster's phylogenic position in the Lophotrochozoa, an understudied clade of bilaterian animals, makes molecular data on C. gigas highly relevant for studies of genome evolution. (III) Oysters play an important role as sentinels in estuarine and coastal marine habitats where increasing human activities exacerbate the impacts of disease and stress in exploited populations. (IV) C. gigas can be an invasive species when introduced into new habitats [8]
The genomic strategies currently employed for the identification of novel and previously-characterized genes affecting phenotypes of interest in the Pacific oyster include the identification of quantitative trait loci (QTL), and high-throughput studies of gene expression [21]

Summary

Conclusion

We report the production and the sequencing of clones from 9 cDNA libraries derived from different C. gigas tissues, and from oysters sampled under different conditions, obtaining 40,845 high-quality ESTs that identify 29,745 unique transcribed sequences. Putative annotation was assigned to 43% of the sequences showing similarity to known genes, mostly from other species, in one or more of the databases used for automatic annotation. All data on ESTs, clustering, and annotation can be accessed from the dedicated database, GigasDatabase, available at http://public-contig browser.sigenae.org:9090/Crassostrea_gigas/index.html. This table lists 12790 non-redundant sequences identifying known C. gigas sequences showing significant similarity (E-value < 10-6) with predicted proteins from mollusks and other organisms. This table includes the GenBank Accession numbers of the ESTs and corresponding best SwissProt hit descriptions. Number of contigs Putative SNP sites with > 50 sequences with 11–50 sequences with 6–10 sequences with 5 sequences with 4 sequences with 3 sequences with 2 sequences

Background

Utility and discussion

Findings

47. Rafalski A

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Jul 29, 2009
Citations: 168	License type: cc-by

R Discovery Prime

R Discovery Prime

Generation and analysis of a 29,745 unique Expressed Sequence Tags from the Pacific oyster (Crassostrea gigas) assembled into a publicly accessible database: the GigasDatabase

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Comparison of microsatellites and SNPs for pedigree analysis in the Pacific oyster Crassostrea gigas
Ting Liu ... Qi Li
Aquaculture International | VOL. 25
Ting Liu, et. al.Ting Liu ... Qi Li
09 Mar 2017
Aquaculture International | VOL. 25

Introduction and evaluation on the US West Coast of a new strain (Midori) of Pacific oyster (Crassostrea gigas) collected from the Ariake Sea, southern Japan
Claudio Manoel Rodrigues De Melo ... Chris Langdon
Aquaculture | VOL. 531
Claudio Manoel Rodrigues De Melo, et. al.Claudio Manoel Rodrigues De Melo ... Chris Langdon
28 Sep 2020
Aquaculture | VOL. 531

BHLH genes polymorphisms and their association with growth traits in the Pacific oyster Crassostrea gigas
Na Chen ... Chenghua Li
Journal of Oceanology and Limnology | VOL. 38
Na Chen, et. al.Na Chen ... Chenghua Li
05 Dec 2019
Journal of Oceanology and Limnology | VOL. 38

Association and Functional Analyses Revealed That PPP1R3B Plays an Important Role in the Regulation of Glycogen Content in the Pacific Oyster Crassostrea gigas.
Sheng Liu ... Baoyu Huang
Frontiers in genetics | VOL. 10
Sheng Liu, et. al.Sheng Liu ... Baoyu Huang
14 Feb 2019
Frontiers in genetics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generation and analysis of a 29,745 unique Expressed Sequence Tags from the Pacific oyster (Crassostrea gigas) assembled into a publicly accessible database: the GigasDatabase

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics