Abstract

DNA barcode reference libraries linked to voucher specimens create new opportunities for high-throughput identification and taxonomic re-evaluations. This study provides a DNA barcode library for about 45% of the recognized species of Canadian Hemiptera, and the publically available R workflow used for its generation. The current library is based on the analysis of 20,851 specimens including 1849 species belonging to 628 genera and 64 families. These individuals were assigned to 1867 Barcode Index Numbers (BINs), sequence clusters that often coincide with species recognized through prior taxonomy. Museum collections were a key source for identified specimens, but we also employed high-throughput collection methods that generated large numbers of unidentified specimens. Many of these specimens represented novel BINs that were subsequently identified by taxonomists, adding barcode coverage for additional species. Our analyses based on both approaches includes 94 species not listed in the most recent Canadian checklist, representing a potential 3% increase in the fauna. We discuss the development of our workflow in the context of prior DNA barcode library construction projects, emphasizing the importance of delineating a set of reference specimens to aid investigations in cases of nomenclatural and DNA barcode discordance. The identification for each specimen in the reference set can be annotated on the Barcode of Life Data System (BOLD), allowing experts to highlight questionable identifications; annotations can be added by any registered user of BOLD, and instructions for this are provided.

Highlights

  • In this study, we present a DNA barcode library as a set of publicly available COI-5’ sequences linked to voucher specimens on the Barcode of Life Data System (BOLD) [1] that meet DNA barcode data standards [1], are identified to species listed in a taxonomic catalogue, and specified using a Digital Object Identifier (DOI, http://www.doi.org/)

  • The current library is based on the analysis of 20,851 specimens including 1849 species belonging to 628 genera and 64 families. These individuals were assigned to 1867 Barcode Index Numbers (BINs), sequence clusters that often coincide with species recognized through prior taxonomy

  • We present a DNA barcode library as a set of publicly available COI-5’ sequences linked to voucher specimens on the Barcode of Life Data System (BOLD) [1] that meet DNA barcode data standards [1], are identified to species listed in a taxonomic catalogue, and specified using a Digital Object Identifier (DOI, http://www.doi.org/)

Read more

Summary

Introduction

We present a DNA barcode library as a set of publicly available COI-5’ sequences linked to voucher specimens on the Barcode of Life Data System (BOLD) [1] that meet DNA barcode data standards [1], are identified to species listed in a taxonomic catalogue, and specified using a Digital Object Identifier (DOI, http://www.doi.org/). DNA barcode libraries make it possible sequences with their source specimens, which have been collected across time, PLOS ONE | DOI:10.1371/journal.pone.0125635. Hemiptera-of-Canada-DNA-BarcodeLibrary-workflow An active version of the R code is available at: https://github.com/RodgerG/Hemipteraof-Canada-DNA-Barcode-Library-workflow.git

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call