Abstract

DNA barcoding is an efficient method to identify specimens and to detect undescribed/cryptic species. Sanger sequencing of individual specimens is the standard approach in generating large-scale DNA barcode libraries and identifying unknowns. However, the Sanger sequencing technology is, in some respects, inferior to next-generation sequencers, which are capable of producing millions of sequence reads simultaneously. Additionally, direct Sanger sequencing of DNA barcode amplicons, as practiced in most DNA barcoding procedures, is hampered by the need for relatively high-target amplicon yield, coamplification of nuclear mitochondrial pseudogenes, confusion with sequences from intracellular endosymbiotic bacteria (e.g. Wolbachia) and instances of intraindividual variability (i.e. heteroplasmy). Any of these situations can lead to failed Sanger sequencing attempts or ambiguity of the generated DNA barcodes. Here, we demonstrate the potential application of next-generation sequencing platforms for parallel acquisition of DNA barcode sequences from hundreds of specimens simultaneously. To facilitate retrieval of sequences obtained from individual specimens, we tag individual specimens during PCR amplification using unique 10-mer oligonucleotides attached to DNA barcoding PCR primers. We employ 454 pyrosequencing to recover full-length DNA barcodes of 190 specimens using 12.5% capacity of a 454 sequencing run (i.e. two lanes of a 16 lane run). We obtained an average of 143 sequence reads for each individual specimen. The sequences produced are full-length DNA barcodes for all but one of the included specimens. In a subset of samples, we also detected Wolbachia, nontarget species, and heteroplasmic sequences. Next-generation sequencing is of great value because of its protocol simplicity, greatly reduced cost per barcode read, faster throughout and added information content.

Highlights

  • The use of short, standardized DNA sequences, or DNA barcodes, for the purposes of individual identification of organisms has contributed to different areas of biological research (Savolainen et al 2005; Hajibabaei et al 2007)

  • The conventional means of generating DNA sequence data to obtain a barcode for a species or a specimen are through PCR amplification and Sanger sequencing (Sanger et al 1977) of DNA barcode sequences from genomic DNA extracted from individual specimens

  • For 178 of the 190 specimens (93.7%), identifiable DNA sequences were recovered by the Sanger method

Read more

Summary

Introduction

The use of short, standardized DNA sequences, or DNA barcodes, for the purposes of individual identification of organisms has contributed to different areas of biological research (Savolainen et al 2005; Hajibabaei et al 2007). The conventional means of generating DNA sequence data to obtain a barcode for a species or a specimen are through PCR amplification and Sanger sequencing (Sanger et al 1977) of DNA barcode sequences from genomic DNA extracted from individual specimens. For library construction, this is for well-identified specimens, while for species detection, the specimen need not be described. The fungal DNA barcode, ITS, has been shown to be present as multiple variable copies within an individual’s genome (Schoch et al 2012), leading to difficulty in direct Sanger sequencing of amplicons. The genetic information contained within these alternate DNA sequences is discarded to generate a single barcode sequence

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call