Abstract

BackgroundThe use of environmental DNA for species detection via metabarcoding is growing rapidly. We present a co-designed lab workflow and bioinformatic pipeline to mitigate the 2 most important risks of environmental DNA use: sample contamination and taxonomic misassignment. These risks arise from the need for polymerase chain reaction (PCR) amplification to detect the trace amounts of DNA combined with the necessity of using short target regions due to DNA degradation.FindingsOur high-throughput workflow minimizes these risks via a 4-step strategy: (i) technical replication with 2 PCR replicates and 2 extraction replicates; (ii) using multi-markers (12S,16S,CytB); (iii) a “twin-tagging,” 2-step PCR protocol; and (iv) use of the probabilistic taxonomic assignment method PROTAX, which can account for incomplete reference databases. Because annotation errors in the reference sequences can result in taxonomic misassignment, we supply a protocol for curating sequence datasets. For some taxonomic groups and some markers, curation resulted in >50% of sequences being deleted from public reference databases, owing to (i) limited overlap between our target amplicon and reference sequences, (ii) mislabelling of reference sequences, and (iii) redundancy. Finally, we provide a bioinformatic pipeline to process amplicons and conduct PROTAX assignment and tested it on an invertebrate-derived DNA dataset from 1,532 leeches from Sabah, Malaysia. Twin-tagging allowed us to detect and exclude sequences with non-matching tags. The smallest DNA fragment (16S) amplified most frequently for all samples but was less powerful for discriminating at species rank. Using a stringent and lax acceptance criterion we found 162 (stringent) and 190 (lax) vertebrate detections of 95 (stringent) and 109 (lax) leech samples.ConclusionsOur metabarcoding workflow should help research groups increase the robustness of their results and therefore facilitate wider use of environmental and invertebrate-derived DNA, which is turning into a valuable source of ecological and conservation information on tetrapods.

Highlights

  • Monitoring, or even detecting, elusive or cryptic species in the wild can be challenging

  • Our high-throughput workflow minimizes these risks via a 4-step strategy: (i) technical replication with 2 polymerase chain reaction (PCR) replicates and 2 extraction replicates; (ii) using multi-markers (12S, 16S ribosomal RNA (rRNA) (16S), cytochrome b (CytB)); (iii) a “twin-tagging,” 2-step PCR protocol; and (iv) use of the probabilistic taxonomic assignment method PROTAX, which can account for incomplete reference databases

  • Reference database A custom bash script was written to generate a tetrapod reference database for ≤4 mitochondrial markers—a short 93–base pair fragment of 16S rRNA (16S), a 389-bp fragment of 12S rRNA (12S), a 302-bp fragment of cytochrome b (CytB), and a 250bp mitochondrial cytochrome c oxidase subunit I amplicon (COI) that has previously been used in invertebrate-derived DNA” (iDNA) studies [2]

Read more

Summary

Introduction

Monitoring, or even detecting, elusive or cryptic species in the wild can be challenging. In recent years there has been an increase in the availability of cost-effective DNA-based methods made possible by advances in high-throughput DNA sequencing (HTS) One such method is eDNA metabarcoding, which seeks to identify the species present in a habitat from traces of “environmental DNA” (eDNA) in substrates such as water, soil, or faeces. We present a co-designed lab workflow and bioinformatic pipeline to mitigate the 2 most important risks of environmental DNA use: sample contamination and taxonomic misassignment. These risks arise from the need for polymerase chain reaction (PCR) amplification to detect the trace amounts of DNA combined with the necessity of using short target regions due to DNA degradation. Conclusions: Our metabarcoding workflow should help research groups increase the robustness of their results and facilitate wider use of environmental and invertebrate-derived DNA, which is turning into a valuable source of ecological and conservation information on tetrapods

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call