Abstract
High‐throughput sequencing of amplicons (HTSA) has been proposed as an effective approach to evaluate taxonomic and genetic diversity at the same time. However, there are still uncertainties as to how the results produced by different bioinformatics treatments impact the conclusions drawn on biodiversity and population genetics indices.We evaluated the ability of six bioinformatics pipelines to recover taxonomic and genetic diversity from HTSA data obtained from controlled assemblages. To that end, 20 assemblages were produced using 354 colonies of Botrylloides spp., sampled in the wild in ten marinas around Brittany (France). We used DNA extracted from preservative ethanol (ebDNA) after various time of storage (3, 6, and 12 months), and from a bulk of preserved specimens (bulkDNA). DNA was amplified with primers designed for targeting this ascidian genus. Results obtained from HTSA data were compared with Sanger sequencing on individual zooids (i.e., individual barcoding).Species identification and relative abundance determined with HTSA data from either ebDNA or bulkDNA were similar to those obtained with traditional individual barcoding. However, after 12 months of storage, the correlation between HTSA and individual‐based data was lower than after shorter durations. The six bioinformatics pipelines were able to depict accurately the genetic diversity using standard population genetics indices (HS and FST), despite producing false positives and missing rare haplotypes. However, they did not perform equally and dada2 was the only pipeline able to retrieve all expected haplotypes.This study showed that ebDNA is a nondestructive alternative for both species identification and haplotype recovery, providing storage does not last more than 6 months before DNA extraction. Choosing the bioinformatics pipeline is a matter of compromise, aiming to retrieve all true haplotypes while avoiding false positives. We here recommend to process HTSA data using dada2, including a chimera‐removal step. Even if the possibility to use multiplexed primer sets deserves further investigation to expand the taxonomic coverage in future similar studies, we showed that primers targeting a particular genus allowed to reliably analyze this genus within a complex community.
Highlights
Most biodiversity assessments rely on taxonomic diversity, many other components, potentially uncorrelated, are crucial for an exhaustive biodiversity assessment (Lindegren et al, 2018)
Species identification and relative abundance determined with High-throughput sequencing of amplicons (HTSA) data from either ebDNA or bulkDNA were similar to those obtained with traditional individual barcoding
Previous reports showed that different bioinformatics pipelines may produce divergent results regarding taxonomic diversity, especially for species richness (Calderón-Sanou et al, 2020; Pauvert et al, 2019), but to our knowledge, the consequences of the choice of different algorithms on haplotype recovery, as well as the impact of the resulting false positives and negatives on commonly used population genetics indices, have not been investigated
Summary
Most biodiversity assessments rely on taxonomic diversity, many other components (functional, phylogenetic, genetic...), potentially uncorrelated, are crucial for an exhaustive biodiversity assessment (Lindegren et al, 2018) In this context, high-throughput sequencing (HTS) of mixed DNAs (Makiola et al, 2020) could be an interesting tool as it offers the possibility to analyze simultaneously two biodiversity components (i.e., taxonomic and genetic). Our goal was to recommend a methodology for jointly assessing taxonomic and genetic diversity via HTS on ebDNA To this end, we evaluated the effectiveness of six metabarcoding analysis pipelines, based on either clustering or denoising approaches, to recover COI haplotypes and assess population genetic diversity indices. We examined biofouling communities from marinas which are composed of many nonindigenous species, a major driver of biodiversity loss
Published Version (
Free)
Join us for a 30 min session where you can share your feedback and ask us any queries you have