Abstract

We developed a tool, implemented in an R package called true and accurate clone generator (TACG), to simulate 'ground truth' and realistic SNP array and single nucleotide variant (SNV) data. We present TACG and use it to assess several different approaches to segmentation of copy number data from SNP arrays, with a particular interest in detecting copy number variations (CNVs) in cancer samples. We demonstrate that DNAcopy, an algorithm using circular binary segmentation, generally performs best, which is in agreement with previous research. We determine the conditions under which it and other methods break down. In particular, we assess how characteristics like clonal heterogeneity, presence of nested CNVs, and the type of aberration affect algorithm accuracy. The simulations we generated proved to be useful in determining not just the comparative overall accuracy of different algorithms, but also in determining how their efficacy is affected by the biological characteristics of samples from which the data was generated.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.