Abstract

In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced.

Highlights

  • Natural history collections document the diversity of life on Earth, past and present

  • We explored de novo and reference-based assemblies of the High-Throughput Sequencing (HTS) museum specimens to test for recovery of target regions

  • Among the 15 specimens selected for HTS, total DNA extracted ranged from 9.9 ng to over 3 μg and modal fragment size of specimens ranged from 50 bases to more than 9000 bases (Figs 4 and 5)

Read more

Summary

Introduction

Natural history collections document the diversity of life on Earth, past and present. They are rich sources of biological discoveries, as well as repositories for vouchers documenting the species studied in previous research. Museum specimens, especially those in herbaria and vertebrate collections, are being used in molecular studies [5,6,7]. Many organismal disciplines that rely on natural history collections are transitioning towards a reliance on molecular data, biologists, especially those who work on small organisms such as insects, do not often view museum specimens as a primary source for DNA sequences, perhaps because DNA degradation prevents traditional PCR based methods of sequence recovery [4,8,9,10]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call