Abstract
Whole genome sequencing (WGS) in cancer genomics has become widespread with recent technological innovations, and the amount and types of information obtained from WGS are increasing rapidly. Appropriate interpretation of results is becoming increasingly important in clinical applications. This study aimed to evaluate the accuracy of tumor content estimation and its impact on somatic variant detection, using 100 simulated tumor samples covering 10-100% tumor content constructed from the sequencing data of cell line models. Extensive analysis revealed that the estimation results varied among computational analytical methods. Notably, there was a large discrepancy in low tumor content (≤ 30%). The reproducibility decreased in cases wherein chromosome-scale copy number changes were observed in normal cells. The minimum tumor content required to detect somatic alterations was estimated to be 10-30%. Identification of whole genome doubling was achieved with the lowest tumor content, followed by single nucleotide variation/insertion or deletion, structural variation, and copy number variation. Tumor content had a significantly higher impact on the false negatives than the false positives in variant calls. Results should be interpreted cautiously for samples wherein tumor content is a concern. These results can form the basis of developing important guidelines for evaluating cancer WGS.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.