Reconstructing DNA copy number by joint segmentation of multiple sequences

Zhongyang Zhang,Kenneth Lange,Chiara Sabatti

doi:10.1186/1471-2105-13-205

Zhongyang Zhang, Kenneth Lange + Show 1 more

Open Access

https://doi.org/10.1186/1471-2105-13-205

Copy DOI

Abstract

BackgroundVariations in DNA copy number carry information on the modalities of genome evolution and mis-regulation of DNA replication in cancer cells. Their study can help localize tumor suppressor genes, distinguish different populations of cancerous cells, and identify genomic variations responsible for disease phenotypes. A number of different high throughput technologies can be used to identify copy number variable sites, and the literature documents multiple effective algorithms. We focus here on the specific problem of detecting regions where variation in copy number is relatively common in the sample at hand. This problem encompasses the cases of copy number polymorphisms, related samples, technical replicates, and cancerous sub-populations from the same individual.ResultsWe present a segmentation method named generalized fused lasso (GFL) to reconstruct copy number variant regions. GFL is based on penalized estimation and is capable of processing multiple signals jointly. Our approach is computationally very attractive and leads to sensitivity and specificity levels comparable to those of state-of-the-art specialized methodologies. We illustrate its applicability with simulated and real data sets.ConclusionsThe flexibility of our framework makes it applicable to data obtained with a wide range of technology. Its versatility and speed make GFL particularly useful in the initial screening stages of large data sets.

Highlights

Variations in DNA copy number carry information on the modalities of genome evolution and mis-regulation of DNA replication in cancer cells
We used the collection of copy number variant (CNV) observed in HapMap Phase III [5] to compile a list of 426 copy number polymorphisms and assumed that if we identify in our sample a CNV corresponding to one of these regions, we should consider it a true positive
We considered two multiplesample algorithms: generalized fused lasso (GFL) and MSSCAN [16], both applied on Log R ratio (LRR) with the group structure defined by pedigree membership. (While a trio-mode is available in PennCNV [55], this does not adapt to the structure of our families.) A final qualification is in order

Summary

Introduction

Variations in DNA copy number carry information on the modalities of genome evolution and mis-regulation of DNA replication in cancer cells. Their study can help localize tumor suppressor genes, distinguish different populations of cancerous cells, and identify genomic variations responsible for disease phenotypes. A number of different high throughput technologies can be used to identify copy number variable sites, and the literature documents multiple effective algorithms. We focus here on the specific problem of detecting regions where variation in copy number is relatively common in the sample at hand. This problem encompasses the cases of copy number polymorphisms, related samples, technical replicates, and cancerous sub-populations from the same individual. One is based on the hidden Markov model (HMM) machinery and explicitly aims to reconstruct the unobservable discrete DNA copy number; the other, which we will generically call “segmentation”, aims at identifying

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Aug 16, 2012
Citations: 58	License type: cc-by

R Discovery Prime

R Discovery Prime

Reconstructing DNA copy number by joint segmentation of multiple sequences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

A Total-variation Constrained Permutation Model for Revealing Common Copy Number Patterns
Yue Zhang ... Weifeng Su
Scientific Reports | VOL. 7
Yue Zhang, et. al.Yue Zhang ... Weifeng Su
29 Aug 2017
Scientific Reports | VOL. 7

Copy number variants calling for single cell sequencing data by multi-constrained optimization
Bo Xu ... Guoqiang Han
Computational Biology and Chemistry | VOL. 63
Bo Xu, et. al.Bo Xu ... Guoqiang Han
17 Feb 2016
Computational Biology and Chemistry | VOL. 63

CaPeptides: Selectively targeting caPCNA and sensitizing gemcitabine in pancreatic cancer cells
Fei Shen ... Jasmine Kamran
Clinical Cancer Research | VOL. 16
Fei Shen, et. al.Fei Shen ... Jasmine Kamran
01 Oct 2010
Clinical Cancer Research | VOL. 16

Abstract B29: Inhibition of histone deacetylases 1 and 2 (HDAC1,2) perturbs DNA replication and DNA repair in cancer cells: Implications in mechanism-based therapeutic strategies
Danielle Johnson ... Steven Quayle
Cancer Research | VOL. 76
Danielle Johnson, et. al.Danielle Johnson ... Steven Quayle
14 Jan 2016
Cancer Research | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reconstructing DNA copy number by joint segmentation of multiple sequences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics