Abstract

Cancer genomics has been evolving rapidly, fueled by the emergence of numerous studies and public databases through next-generation sequencing technologies. However, the downstream programs used to preprocess and analyze data on somatic mutations are scattered in different tools, most of which require specific input formats. Here, we developed a user-friendly Python toolkit, MutScape, which provides a comprehensive pipeline of filtering, combination, transformation, analysis and visualization for researchers, to easily explore the cohort-based mutational characterization for studying cancer genomics when obtaining somatic mutation data. MutScape not only can preprocess millions of mutation records in a few minutes, but also offers various analyses simultaneously, including driver gene detection, mutational signature, large-scale alteration identification and actionable biomarker annotation. Furthermore, MutScape supports somatic variant data in both variant call format and mutation annotation format, and leverages caller combination strategies to quickly eliminate false positives. With only two simple commands, robust results and publication-quality images are generated automatically. Herein, we demonstrate the ability of MutScape to correctly reproduce known results using breast cancer samples from The Cancer Genome Atlas. More significantly, discovery of novel results in cancer genomic studies is enabled through the advanced features in MutScape. MutScape is freely available on GitHub, at https://github.com/anitalu724/MutScape.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call