Abstract

BackgroundTranscriptional analysis is widely used to study the molecular biology of cancer and hold great biomarker potential for clinical patient stratification. Yet, accurate transcriptional profiling requires RNA of a high quality, which often cannot be retrieved from formalin-fixed, paraffin-embedded (FFPE) tumor tissue that is routinely collected and archived in clinical departments. To overcome this roadblock to clinical testing, we previously developed MethCORR, a method that infers gene expression from DNA methylation data, which is robustly retrieved from FFPE tissue. MethCORR was originally developed for colorectal cancer and with this study, we aim to: (1) extend the MethCORR method to 10 additional cancer types and (2) to illustrate that the inferred gene expression is accurate and clinically informative.ResultsRegression models to infer gene expression information from DNA methylation were developed for ten common cancer types using matched RNA sequencing and DNA methylation profiles (HumanMethylation450 BeadChip) from The Cancer Genome Atlas Project. Robust and accurate gene expression profiles were inferred for all cancer types: on average, the expression of 11,000 genes was modeled with good accuracy and an intra-sample correlation of R2 = 0.90 between inferred and measured gene expression was observed. Molecular pathway analysis and transcriptional subtyping were performed for breast, prostate, and lung cancer samples to illustrate the general usability of the inferred gene expression profiles: overall, a high correlation of r = 0.96 (Pearson) in pathway enrichment scores and a 76% correspondence in molecular subtype calls were observed when using measured and inferred gene expression as input. Finally, inferred expression from FFPE tissue correlated better with RNA sequencing data from matched fresh-frozen tissue than did RNA sequencing data from FFPE tissue (P < 0.0001; Wilcoxon rank-sum test).ConclusionsIn all cancers investigated, MethCORR enabled DNA methylation-based transcriptional analysis, thus enabling future analysis of cancer in situations where high-quality DNA, but not RNA, is available. Here, we provide the framework and resources for MethCORR modeling of ten common cancer types, thereby widely expanding the possibilities for transcriptional studies of archival FFPE material.

Highlights

  • Transcriptional analysis is widely used to study the molecular biology of cancer and hold great biomarker potential for clinical patient stratification

  • This involved application of the following steps in samples used for training (80% of samples for each cancer; Fig. 1a): (1) identification of all genome-wide correlations between gene expression and CpG-site methylation levels using matched RNA sequencing and 450K methylation data

  • For samples with available hormone receptor status on the estrogen receptor (ER), we found that samples clustered together according to receptor (ER) status both in a Principal component analysis (PCA) and in bootstrap hierarchical clustering analysis performed with either RNA expression or inferred RNA expression (iRNA) expression in the The Cancer Genome Atlas project (TCGA) BRCA cohort (Fig. 2a and Additional file 6: Fig. S2b and S2c)

Read more

Summary

Introduction

Transcriptional analysis is widely used to study the molecular biology of cancer and hold great biomarker potential for clinical patient stratification. Accurate transcriptional profiling requires RNA of a high quality, which often cannot be retrieved from formalin-fixed, paraffin-embedded (FFPE) tumor tissue that is routinely collected and archived in clinical departments To overcome this roadblock to clinical testing, we previously developed MethCORR, a method that infers gene expression from DNA methylation data, which is robustly retrieved from FFPE tissue. Transcriptomic data have been widely used to resolve this molecular heterogeneity, e.g., by stratification of tumors into homogenous molecular subtypes [2,3,4,5,6,7] and application of subtype-specific biomarkers [5] This paradigm-changing research was primarily performed using high-quality RNA purified from fresh-frozen tumor samples. These represent a highly desirable resource for retrospective studies of tumor classification and to derive more focused biomarkers such as subtype-specific biomarkers [5, 12]

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call