CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing.

Eric Talevich,Boris C Bastian,A Hunter Shain,Thomas Botton

doi:10.1371/journal.pcbi.1004873

Abstract

Germline copy number variants (CNVs) and somatic copy number alterations (SCNAs) are of significant importance in syndromic conditions and cancer. Massively parallel sequencing is increasingly used to infer copy number information from variations in the read depth in sequencing data. However, this approach has limitations in the case of targeted re-sequencing, which leaves gaps in coverage between the regions chosen for enrichment and introduces biases related to the efficiency of target capture and library preparation. We present a method for copy number detection, implemented in the software package CNVkit, that uses both the targeted reads and the nonspecifically captured off-target reads to infer copy number evenly across the genome. This combination achieves both exon-level resolution in targeted regions and sufficient resolution in the larger intronic and intergenic regions to identify copy number changes. In particular, we successfully inferred copy number at equivalent to 100-kilobase resolution genome-wide from a platform targeting as few as 293 genes. After normalizing read counts to a pooled reference, we evaluated and corrected for three sources of bias that explain most of the extraneous variability in the sequencing read depth: GC content, target footprint size and spacing, and repetitive sequences. We compared the performance of CNVkit to copy number changes identified by array comparative genomic hybridization. We packaged the components of CNVkit so that it is straightforward to use and provides visualizations, detailed reporting of significant features, and export options for integration into existing analysis pipelines. CNVkit is freely available from https://github.com/etal/cnvkit.

Highlights

Copy number changes are a useful diagnostic indicator for many diseases, including cancer
Tools have been developed for copy number analysis of these datasets, as well, including CNVer [6], ExomeCNV [7], exomeCopy [8], CONTRA [9], CoNIFER [10], ExomeDepth [11], VarScan 2 [12], XHMM [13], ngCGH [14], EXCAVATOR [15], CANOES [16], PatternCNV [17], CODEX [18], and recent versions of Control-FREEC [19] and cn.MOPS [20]
We evaluated our method on DNA sequencing data from targeted sequencing of the melanoma cell line C0902 [42] and two sets of samples, referred to here as “Targeted sequencing (TR)” and “exome panel (EX)”, derived from a recent study of advanced melanomas [43]:

Summary

Introduction

Copy number changes are a useful diagnostic indicator for many diseases, including cancer. The on– and off-target read depths are combined, normalized to a reference derived from control samples, corrected for several systematic biases to result in a final table of log2 copy ratios.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Apr 21, 2016
Citations: 1399	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

A three-step workflow procedure for the interpretation of array-based comparative genome hybridization results in patients with idiopathic mental retardation and congenital anomalies
Martin Poot ... Ron Hochstenbach
Genetics in Medicine | VOL. 12
Martin Poot, et. al.Martin Poot ... Ron Hochstenbach
01 Aug 2010
Genetics in Medicine | VOL. 12

Distinguishing Somatic and Germline Copy Number Events in Cancer Patient DNA Hybridized to Whole-Genome SNP Genotyping Arrays
Gavin Ha ... Sohrab Shah
-
Gavin Ha, et. al.Gavin Ha ... Sohrab Shah
01 Jan 2013
01 Jan 2013

High-throughput Biology in the Postgenomic Era
Albert Hsiao ... Michael D Kuo
Journal of Vascular and Interventional Radiology | VOL. 20
Albert Hsiao, et. al.Albert Hsiao ... Michael D Kuo
01 Jul 2009
Journal of Vascular and Interventional Radiology | VOL. 20

Array Comparative Genomic Hybridization for Genetic Evaluation of Fetal Loss Between 10 and 20 Weeks of Gestation
Jennifer E Warren ... David K Turok
Obstetrics & Gynecology | VOL. 114
Jennifer E Warren, et. al.Jennifer E Warren ... David K Turok
01 Nov 2009
Array Comparative Genomic Hybridization for Genetic Evaluation of Fetal Loss Between 10 and 20 Weeks of Gestation
Jennifer E Warren ... David K Turok

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology