RADseq Libraries Research Articles

Whole-genome amplification by multiple displacement amplification (MDA) is a promising technique to enable the use of samples with only limited amount of DNA for the construction of RAD-seq libraries. Previous work has shown that, when the amount of DNA used in the MDA reaction is large, double-digest RAD-seq (ddRAD) libraries prepared with amplified genomic DNA result in data that are indistinguishable from libraries prepared directly from genomic DNA. Based on this observation, here we evaluate the quality of ddRAD libraries prepared from MDA-amplified genomic DNA when the amount of input genomic DNA and the coverage obtained for samples is variable. By simultaneously preparing libraries for five species of weevils (Coleoptera, Curculionidae), we also evaluate the likelihood that potential contaminants will be encountered in the assembled dataset. Overall, our results indicate that MDA may not be able to rescue all samples with small amounts of DNA, but it does produce ddRAD libraries adequate for studies of phylogeography and population genetics even when conditions are not optimal. We find that MDA makes it harder to predict the number of loci that will be obtained for a given sequencing effort, with some samples behaving like traditional libraries and others yielding fewer loci than expected. This seems to be caused both by stochastic and deterministic effects during amplification. Further, the reduction in loci is stronger in libraries with lower amounts of template DNA for the MDA reaction. Even though a few samples exhibit substantial levels of contamination in raw reads, the effect is very small in the final dataset, suggesting that filters imposed during dataset assembly are important in removing contamination. Importantly, samples with strong signs of contamination and biases in heterozygosity were also those with fewer loci shared in the final dataset, suggesting that stringent filtering of samples with significant amounts of missing data is important when assembling data derived from MDA-amplified genomic DNA. Overall, we find that the combination of MDA and ddRAD results in high-quality datasets for population genetics as long as the sequence data is properly filtered during assembly.

Read full abstract

Second- and third-generation sequencing technologies are driving a revolution in biology and medicine, with ultra-high throughput sequencers now able to produce 200 human genomes every 3 days at a cost of $1000 per genome (Watson, 2014). Meanwhile, in our lab at Edinburgh Genomics and in other labs throughout the World, researchers are generating their first single-molecule reads from a hand-held, USB-powered sequencer as part of Oxford Nanopore’s MinION access programme (MinION Access Programme, 2014 1 ). Whilst the revolution in biology is recognized, the associated revolution in bioinformatics often goes unmentioned. This Frontiers “Research Topic” is about that revolution; it is about data, and data-driven discovery. The sequencers mentioned above, and others from Pacific Biosciences and Ion Torrent, produce either huge amounts of data, data that are very complex, or both. Bioinformaticians throughouttheworldarecreatingnovelpipelines,algorithmsand tools to be able to cope with the huge amount of diverse data types that can be produced. The very first step in many of those pipelines and tools is quality assessment, quality control and artifact removal. These issues all involve data-driven research—what can we learn from the data? What are the data telling us about quality and artifacts? The first group of papers in the research topic deal with quality assessment and reveal pipelines that are in use in sequencing facilities today. The second set of papers deal with applications of sequencing technologies to particular domains, and how we can improve those applications through effective control of quality and artifacts. The final set of papers deal with very specific biological questions, and what we can learn from the raw data to improve our analyses and help us to better answer those questions. A series of bioinformatics pipelines are applied to sequencing data by the data generating facility, and it is important that those who work with sequencing data understand these. Leggett et al. (2013) reveal many of the pipelines and tools used at The Genome Analysis Centre (TGAC), a genomics institute based in Norwich, UK, which has access to every major sequencing platform. Their paper describes every step in the data generation pipeline, from their Laboratory Information Management System (LIMS) to data-specific pipelines for matepair and RAD-Seq libraries. Similarly, Paszkiewicz et al. (2014)

Read full abstract

RADseq Libraries Research Articles

Related Topics

Articles published on RADseq Libraries

On the causes, consequences, and avoidance of PCR duplicates: Towards a theory of library complexity.

The RadOrgMiner pipeline: Automated genotyping of organellar loci from RADseq data

Development of genetic tools for the redbait species Pyura herdmani and P. stolonifera, important bioengineers along African coastlines

Characterization of Single Nucleotide Polymorphism Marker in the Chinese Giant Salamander Andrias davidianus

Pushing the limits of whole genome amplification: successful sequencing of RADseq library from a single microhymenopteran (Chalcidoidea, Trichogramma).

Whole-genome amplification in double-digest RADseq results in adequate libraries but fewer sequenced loci.

Improving DNA quality extracted from fecal samples—a method to improve DNA yield

Identifying homomorphic sex chromosomes from wild-caught adults with limited genomic resources.

RAD sequencing reveals within-generation polygenic selection in response to anthropogenic organic and metal contamination in North Atlantic Eels.

Optimization of multiplexed RADseq libraries using low-cost adaptors.

Trade-offs and utility of alternative RADseq methods: reply to Puritz et al.

Quality assessment and control of high-throughput sequencing data.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

RADseq Libraries Research Articles

Related Topics

Articles published on RADseq Libraries

On the causes, consequences, and avoidance of PCR duplicates: Towards a theory of library complexity.

The RadOrgMiner pipeline: Automated genotyping of organellar loci from RADseq data

Development of genetic tools for the redbait species Pyura herdmani and P. stolonifera, important bioengineers along African coastlines

Characterization of Single Nucleotide Polymorphism Marker in the Chinese Giant Salamander Andrias davidianus

Pushing the limits of whole genome amplification: successful sequencing of RADseq library from a single microhymenopteran (Chalcidoidea, Trichogramma).

Whole-genome amplification in double-digest RADseq results in adequate libraries but fewer sequenced loci.

Improving DNA quality extracted from fecal samples—a method to improve DNA yield

Identifying homomorphic sex chromosomes from wild-caught adults with limited genomic resources.

RAD sequencing reveals within-generation polygenic selection in response to anthropogenic organic and metal contamination in North Atlantic Eels.

Optimization of multiplexed RADseq libraries using low-cost adaptors.

Trade-offs and utility of alternative RADseq methods: reply to Puritz et al.

Quality assessment and control of high-throughput sequencing data.