Abstract

BackgroundGenotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data.MethodsWe have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells.ConclusionsWe validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM.AvailabilityThe MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM.

Highlights

  • Chromosomal structural abnormalities leading to copy number changes, including deletions and amplifications, are common in cancer and certain regions are commonly altered, suggesting their role in the pathogenesis of this disease [1,2]

  • Availability: The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med. yale.edu:8080/MixHMM

  • By combining with a novel sample mixing model, we demonstrate that MixHMM can detect the Copy number variation (CNV) states of tumor cells in a heterogeneous sample contaminated with normal cells

Read more

Summary

Introduction

Chromosomal structural abnormalities leading to copy number changes, including deletions and amplifications, are common in cancer and certain regions are commonly altered, suggesting their role in the pathogenesis of this disease [1,2]. Copy number somatic alterations (CNA, referred as CNV here after, as we use the same algorithm for detection) have been reported as an important factor leading to cancer [5]. The aCGH hybridization signal is segmented by chromosomal location [10,11], and changes in intensity over a region reflect changes in copy number. Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call