Abstract

The reliability of differential gene expression analysis on formalin-fixed, paraffin-embedded (FFPE) expression profiles generated using Affymetrix arrays is questionable, due to the high range of percent-present values reported in studies which profiled FFPE samples using this technology. Moreover, the validity of gene-modules derived from external datasets in FFPE microarray expression profiles is unknown. By generating matched gene expression profiles using RNAs derived from fresh-frozen (FF) and FFPE preserved breast tumors with Affymetrix arrays and FF/FFPE RNA specific amplification-and-labeling kits, the reliability of differential expression analysis and the validity of gene modules derived from external datasets were investigated. Specifically, the reliability of differential expression analysis was investigated by developing de-novo ER/HER2 pathway gene-modules from the matched datasets and validating them on external FF/FFPE gene expression datasets using ROC analysis. Spearman's rank correlation coefficient of module scores between matched FFPE/frozen datasets was used to measure the reliability of gene-modules derived from external datasets in FFPE expression profiles. Independent of the array/amplification-kit/sample preservation method used, de-novo ER/HER2 gene-modules derived from all matched datasets showed similar prediction performance in the independent validation (AUC range in FFPE dataset; ER: 0.93–0.95, HER2: 0.85–0.91), except for the de-novo ER/HER2 gene-module derived from the FFPE dataset using the 3'IVT kit (AUC range in FFPE dataset; ER: 0.79–0.81, HER2: 0.78). Among the external gene modules considered, roughly ~50% gene modules showed high concordance between expression profiles derived from matching FF and FFPE RNA. The remaining discordant gene modules between FF and FFPE expression profiles showed high concordance within matching FF datasets and within matching FFPE datasets independently, implying that microarrays still require improved amplification-and-sample-preparation protocols for deriving 100% concordant expression profiles from matching FF and FFPE RNA.

Highlights

  • Using Affymetrix arrays with formalin-fixed, paraffin-embedded (FFPE) specific sample preparation protocols, many studies reported the feasibility of generating reliable gene expression profiles from FFPE samples [1,2,3,4,5,6,7]

  • The raw expression values from the u133p2.3ivt. ffpe dataset was close to background, which is expected as the polyA-tail dependent 3ivt kit is not ideal for amplifying degraded RNA

  • Studies which profiled FFPE samples reported a high range of percent-present values [1,2,3,4,5,6], reflecting the varying proportion of reliable genes in FFPE expression profiles

Read more

Summary

Introduction

Using Affymetrix arrays with formalin-fixed, paraffin-embedded (FFPE) specific sample preparation protocols, many studies reported the feasibility of generating reliable gene expression profiles from FFPE samples [1,2,3,4,5,6,7]. Though Affymetrix recommends similar percent-present values between samples for reliable analysis [8], most of these studies reported a high range of percent-present values [1,2,3,4,5,6], reflecting the varying proportion of reliable genes in FFPE expression profiles. This high range of percent-present values raises concerns over the reliability of differential expression analysis performed on FFPE expression profiles and the gene signatures or gene-modules developed from them. Using a similar approach (ref-1/ref-2/ ref-3) gene and sample level correlations between matched datasets were investigated

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call