DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

Marie Lisandra Zepeda-Mendoza,Kristine Bohmann,M Thomas P Gilbert,Aldo Carmona Baez

doi:10.1186/s13104-016-2064-9

Abstract

BackgroundDNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5′-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way.ResultsWe present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe.ConclusionsDAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.Electronic supplementary materialThe online version of this article (doi:10.1186/s13104-016-2064-9) contains supplementary material, which is available to authorized users.

Highlights

DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers
DAMe filtering thresholds benchmarking Dataset 3 was a mock eDNA sample generated from a laboratory prepared mixture containing known species, at known and equal DNA concentrations, all of which had been CO1 mini-barcoded prior to the experiment. The use of this kind of mock dataset, in which the amplified sequence is known a priori, is useful for detection of error rates and for evaluating filtering strategies [13]. This dataset was an ideal benchmark for calculating the true positive rate (TPR), true negative rate (TNR), false positive rate (FPR), and false negative rate (FNR) of sequences classified as derived from the real sample, or as derived from contamination, or sequencing/PCR errors in the filtering step performed by DAMe with filter.py
The results show that no filtering at all produces the highest TPR, and the highest FPR (0.37) and the lowest TNR (0.009) (Table 3)

Summary

Results

We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequenc‐ ing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate.

Conclusions

Background

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Research Notes	Publication Date: May 3, 2016
Citations: 48	License type: cc-by

R Discovery Prime

R Discovery Prime

DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Research Notes

Lead the way for us

Similar Papers

Scrutinizing key steps for reliable metabarcoding of environmental samples
Antton Alberdi ... Ostaizka Aizpurua
Methods in Ecology and Evolution | VOL. 9
Antton Alberdi, et. al.Antton Alberdi ... Ostaizka Aizpurua
03 Aug 2017
Methods in Ecology and Evolution | VOL. 9

Revisiting the effect of PCR replication and sequencing depth on biodiversity metrics in environmental DNA metabarcoding.
Sabrina Shirazi ... Beth Shapiro
Ecology and Evolution | VOL. 11
Sabrina Shirazi, et. al.Sabrina Shirazi ... Beth Shapiro
22 Oct 2021
Ecology and Evolution | VOL. 11

Testing repeatability, testing repeatability, testing repeatability: harmonization of the DNA metabarcoding protocol for macrobenthos across Europe
Laure Van Den Bulcke ... Pedro Martinez-Arbizu
ARPHA Conference Abstracts | VOL. 4
Laure Van Den Bulcke, et. al.Laure Van Den Bulcke ... Pedro Martinez-Arbizu
04 Mar 2021
ARPHA Conference Abstracts | VOL. 4

Towards harmonization of DNA metabarcoding for monitoring marine macrobenthos: the effect of technical replicates and pooled DNA extractions on species detection
Laure Van Den Bulcke ... Sofie Derycke
Metabarcoding and Metagenomics | VOL. 5
Laure Van Den Bulcke, et. al.Laure Van Den Bulcke ... Sofie Derycke
29 Dec 2021
Metabarcoding and Metagenomics | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Research Notes