A two-dimensional pooling strategy for rare variant detection on next-generation sequencing platforms.

Philip C Zuzarte,John D Mcpherson,Robert E Denroche,Gordon Fehringer,Hagit Katzov-Eckert,Rayjean J Hung

doi:10.1371/journal.pone.0093455

Abstract

We describe a method for pooling and sequencing DNA from a large number of individual samples while preserving information regarding sample identity. DNA from 576 individuals was arranged into four 12 row by 12 column matrices and then pooled by row and by column resulting in 96 total pools with 12 individuals in each pool. Pooling of DNA was carried out in a two-dimensional fashion, such that DNA from each individual is present in exactly one row pool and exactly one column pool. By considering the variants observed in the rows and columns of a matrix we are able to trace rare variants back to the specific individuals that carry them. The pooled DNA samples were enriched over a 250 kb region previously identified by GWAS to significantly predispose individuals to lung cancer. All 96 pools (12 row and 12 column pools from 4 matrices) were barcoded and sequenced on an Illumina HiSeq 2000 instrument with an average depth of coverage greater than 4,000×. Verification based on Ion PGM sequencing confirmed the presence of 91.4% of confidently classified SNVs assayed. In this way, each individual sample is sequenced in multiple pools providing more accurate variant calling than a single pool or a multiplexed approach. This provides a powerful method for rare variant detection in regions of interest at a reduced cost to the researcher.

Highlights

Genome wide association studies (GWAS) provide a wealth of information about the genetic basis of disease
As regions of the genome that are involved in pathogenesis are identified there is a need for improved fine mapping of genetic variants associated with disease over a large number of individuals
Targeted enrichment of specific regions of interest prior to pooling can increase the number of samples processed using current sequencing technologies. Bioinformatics tools such as VarScan and CRISP exist for single nucleotide variant (SNV) calling from pooled samples but are not capable of identifying the specific samples in the pool that contributed the variant [1] [2]

Summary

Introduction

Genome wide association studies (GWAS) provide a wealth of information about the genetic basis of disease. As regions of the genome that are involved in pathogenesis are identified there is a need for improved fine mapping of genetic variants associated with disease over a large number of individuals. Sample pooling is a frequently applied method for sequencing a large number of samples in order to detect variants. Targeted enrichment of specific regions of interest prior to pooling can increase the number of samples processed using current sequencing technologies. Bioinformatics tools such as VarScan and CRISP exist for single nucleotide variant (SNV) calling from pooled samples but are not capable of identifying the specific samples in the pool that contributed the variant [1] [2]. Improved methods are required to enable degrees of sample deconvolution for DNA that is pooled prior to library preparation for sequencing

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Apr 11, 2014
Citations: 33	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A two-dimensional pooling strategy for rare variant detection on next-generation sequencing platforms.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Multiplexed resequencing analysis to identify rare variants in pooled DNA with barcode indexing using next-generation sequencer
Jun Mitsui ... Yuji Takahashi
Journal of Human Genetics | VOL. 55
Jun Mitsui, et. al.Jun Mitsui ... Yuji Takahashi
20 May 2010
Journal of Human Genetics | VOL. 55

Sequence Kernel Association Tests for the Combined Effect of Rare and Common Variants
Iuliana Ionita-Laza ... Xihong Lin
The American Journal of Human Genetics | VOL. 92
Iuliana Ionita-Laza, et. al.Iuliana Ionita-Laza ... Xihong Lin
16 May 2013
The American Journal of Human Genetics | VOL. 92

Estimating the effect of SNP genotype on quantitative traits from pooled DNA samples
John M Henshall ... William Barendse
Genetics Selection Evolution | VOL. 44
John M Henshall, et. al.John M Henshall ... William Barendse
17 Apr 2012
Genetics Selection Evolution | VOL. 44

Extending Rare-Variant Testing Strategies: Analysis of Noncoding Sequence and Imputed Genotypes
Matthew Zawistowski ... Sebastian Zöllner
The American Journal of Human Genetics | VOL. 87
Matthew Zawistowski, et. al.Matthew Zawistowski ... Sebastian Zöllner
01 Nov 2010
The American Journal of Human Genetics | VOL. 87

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A two-dimensional pooling strategy for rare variant detection on next-generation sequencing platforms.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE