Abstract

BackgroundTumor purity plays an important role in understanding the pathogenic mechanism of tumors. The purity of tumor samples is highly sensitive to tumor heterogeneity. Due to Intratumoral heterogeneity of genetic and epigenetic data, it is suitable to study the purity of tumors. Among them, there are many purity estimation methods based on copy number variation, gene expression and other data, while few use DNA methylation data and often based on selected information sites. Consequently, how to choose methylation sites as information sites has an important influence on the purity estimation results. At present, the selection of information sites was often based on the differentially methylated sites that only consider the mean signal, without considering other possible signals and the strong correlation among adjacent sites.ResultsConsidering integrating multi-signals and strong correlation among adjacent sites, we propose an approach, PEIS, to estimate the purity of tumor samples by selecting informative differential methylation sites. Application to 12 publicly available tumor datasets, it is shown that PEIS provides accurate results in the estimation of tumor purity which has a high consistency with other existing methods. Also, through comparing the results of different information sites selection methods in the evaluation of tumor purity, it shows the PEIS is superior to other methods.ConclusionsA new method to estimate the purity of tumor samples is proposed. This approach integrates multi-signals of the CpG sites and the correlation between the sites. Experimental analysis shows that this method is in good agreement with other existing methods for estimating tumor purity.

Highlights

  • Tumor purity plays an important role in understanding the pathogenic mechanism of tumors

  • MethylPurify [1] uses EM algorithm to identify information sites and infer tumor purity; InfiniumPurify [5] identifies information sites by rank-sum test and estimates tumor purity which combined with Gaussian kernel density; PAMES [2] utilizes the methylation levels of dozens of highly cloned specific CpG sites to evaluate the purity of tumor samples

  • Tumor purity estimation results from PEIS To illustrate the advantages of PEIS, we applied four methods to our selected and pretreated DNA methylation data (665 samples from 12 tumor types)

Read more

Summary

Introduction

Tumor purity plays an important role in understanding the pathogenic mechanism of tumors. Due to Intratumoral heterogeneity of genetic and epigenetic data, it is suitable to study the purity of tumors. There are many purity estimation methods based on copy number variation, gene expression and other data, while few use DNA methylation data and often based on selected information sites. An important issue in tumor research is that tumor samples during sampling are always mixed with normal cells, which we refer to as “tumor purity”. Because of the significant genetic and epigenetic differences between tumor cells and normal cells, it is feasible to use available high-throughput data to estimate tumor purity. There are many methods to estimate tumor purity using gene expression [6], copy number variation [7] and single nucleotide polymorphism [8] as predictors, but few are based on methylation differences

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call