Abstract
While genome‐wide demethylation of DNA CpG sites is a well‐established early hallmark of cancer, only recently has the presence of allelic methylation patterns associating with pathological gene expression been reported. Such allelic methylation patterns have often been obscured due to statistical averaging procedures applied to bisulfite conversion sequencing (Bis‐Seq) CpG methylation genomic datasets. Here, we present a Python software suite, Pyllelic, that dissects Bis‐Seq genomic datasets at the individual read level, allowing nuanced analyses of methylation patterns of genes, including allelic patterns associated with gene expression. We used this approach to examine the role of allelic methylation in the telomerase reverse transcriptase (TERT) gene, which is pathologically expressed in the majority of human cancers (approximately 80‐90%) and has known allelic expression patterns. Surprisingly, active TERTgene expression in human cancers has been characterized as having a hypermethylated CpG island in the promoter, which is the opposite of general promoter methylation associations with gene expression, including reports of hypomethylation of promoters correlating with overexpression of oncogenes. However, when we used Pyllelic to analyze >600 cancer cell lines from 23 different tissue types, we found that in the proximal region of the TERTpromoter, allele‐specific hypomethylation correlated with TERTexpression. This analysis was performed in cancer cell lines with known activating, monoallelic promoter mutations as well as in wild‐type TERTcancer cell lines. The Pyllelic suite allows both aggregate and individual read‐level access to the underlying methylated reads of genomic datasets, with a data science‐focused implementation to facilitate the path from hypothesis to publishable figures and statistical analyses. Pyllelic thus should serve as an important tool to help identify allelic DNA methylation patterns and associations for a wide variety of genomic regions, with particular relevance for pathological activation in cancers, as well as other tissue or disease methylation analysis.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have