PriSeT

Marie Hoffmann,Knut Reinert,Michael T Monaghan

doi:10.1145/3459930.3469546

Abstract

Motivation: DNA metabarcoding is commonly used to infer the species composition of environmental samples, whereby a short, homologous DNA sequence is amplified and sequenced from all members of the community. Samples can comprise hundreds of organisms that can be closely or very distantly related. DNA metabarcoding combines polymerase chain reaction (PCR) and next-generation sequencing (NGS), and sequences are taxonomically identified based on their match to a reference database. Ideally, each species of interest would have a unique DNA barcode. This short, variable sequence needs to be flanked by conserved regions that can be used as primer-binding sites. PCR primer pairs would amplify a variable barcode in a broad evolutionary range of taxa. To date, no tools exist that computationally search and analyze the effectiveness of new primer pairs for large unaligned sequence data sets. More specifically we solve the following problem: Given a set of reference sequences R = {R1, R2, ..., Rm}, find a primer set P that allows for a high taxonomic coverage. This goal can be achieved by filtering for frequent primers and ranking by coverage or variation, i.e. the number of unique barcodes for further analysis. Here we present the software PriSeT, an offline primer-discovery tool that is capable of processing large libraries and is robust against mislabeled or low-quality references. It avoids the construction of a multisequence alignment of R. Instead, PriSeT uses encodings of frequent k-mers that allow bit-parallel processing and other optimizations. Results: We first evaluated PriSeT on references (mostly 18S rRNA genes) from 19 clades covering eukaryotic organisms that are typical for freshwater plankton samples. PriSeT recovered several published primer sets as well as additional, more chemically suitable primer sets. For these new sets, we compared frequency, taxonomic coverage, and amplicon variation with published primer sets. For 11 clades we found de novo primer pairs that cover more taxa than the published ones, and for six clades de novo primers resulted in greater sequence (i.e., DNA barcode) variation. We also applied PriSeT to SARS-CoV-2 genomes and computed 114 new primer pairs with the additional constraint that the sequences have no co-occurrences in closely related taxa. These primer sets would be suitable for empirical testing. Availability: https://github.com/mariehoffmann/PriSeT Contact: marie.hoffmann@fu-berlin.de

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PriSeT

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

New Insights Into Nematode DNA-metabarcoding as Revealed by the Characterization of Artificial and Spiked Nematode Communities
Lieven Waeyenberge ... Annelies Haegeman
Diversity | VOL. 11
Lieven Waeyenberge, et. al.Lieven Waeyenberge ... Annelies Haegeman
02 Apr 2019
Diversity | VOL. 11

Amplification of 16S rRNA genes from culturable and nonculturable Mollicutes
Sujun Deng ... Chuji Hiruki
Journal of Microbiological Methods | VOL. 14
Sujun Deng, et. al.Sujun Deng ... Chuji Hiruki
01 Sep 1991
Journal of Microbiological Methods | VOL. 14

Detection of Human Papillomaviruses in Cervical Neoplasias Using Multiple Sets of Generic Polymerase Chain Reaction Primers
Satoshi Kado ... Hiroshi Shirasawa
Gynecologic Oncology | VOL. 81
Satoshi Kado, et. al.Satoshi Kado ... Hiroshi Shirasawa
01 Apr 2001
Gynecologic Oncology | VOL. 81

High‐throughput identification of non‐marine Ostracoda from the Tibetan Plateau: Evaluating the success of various primers on sedimentary DNA samples
Paula Echeverría‐Galindo ... Wengang Kang
Environmental DNA | VOL. 3
Paula Echeverría‐Galindo, et. al.Paula Echeverría‐Galindo ... Wengang Kang
31 May 2021
Environmental DNA | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PriSeT

Abstract

Talk to us

Similar Papers