Abstract

BackgroundGene expression profiling has benefited medicine by providing clinically relevant insights at the molecular candidate and systems levels. However, to adopt a more ‘precision’ approach that integrates individual variability including ‘omics data into risk assessments, diagnoses, and therapeutic decision making, whole transcriptome expression needs to be interpreted meaningfully for single subjects. We propose an “all-against-one” framework that uses biological replicates in isogenic conditions for testing differentially expressed genes (DEGs) in a single subject (ss) in the absence of an appropriate external reference standard or replicates. To evaluate our proposed “all-against-one” framework, we construct reference standards (RSs) with five conventional replicate-anchored analyses (NOISeq, DEGseq, edgeR, DESeq, DESeq2) and the remainder were treated separately as single-subject sample pairs for ss analyses (without replicates).ResultsEight ss methods (NOISeq, DEGseq, edgeR, mixture model, DESeq, DESeq2, iDEG, and ensemble) for identifying genes with differential expression were compared in Yeast (parental line versus snf2 deletion mutant; n = 42/condition) and a MCF7 breast-cancer cell line (baseline versus stimulated with estradiol; n = 7/condition). Receiver-operator characteristic (ROC) and precision-recall plots were determined for eight ss methods against each of the five RSs in both datasets. Consistent with prior analyses of these data, ~ 50% and ~ 15% DEGs were obtained in Yeast and MCF7 datasets respectively, regardless of the RSs method. NOISeq, edgeR, and DESeq were the most concordant for creating a RS. Single-subject versions of NOISeq, DEGseq, and an ensemble learner achieved the best median ROC-area-under-the-curve to compare two transcriptomes without replicates regardless of the RS method and dataset (> 90% in Yeast, > 0.75 in MCF7). Further, distinct specific single-subject methods perform better according to different proportions of DEGs.ConclusionsThe “all-against-one” framework provides a honest evaluation framework for single-subject DEG studies since these methods are evaluated, by design, against reference standards produced by unrelated DEG methods. The ss-ensemble method was the only one to reliably produce higher accuracies in all conditions tested in this conservative evaluation framework. However, single-subject methods for identifying DEGs from paired samples need improvement, as no method performed with precision> 90% and obtained moderate levels of recall.http://www.lussiergroup.org/publications/EnsembleBiomarker

Highlights

  • Gene expression profiling has benefited medicine by providing clinically relevant insights at the molecular candidate and systems levels

  • The “all-against-one” framework provides a honest evaluation framework for single-subject differentially expressed gene (DEG) studies since these methods are evaluated, by design, against reference standards produced by unrelated DEG methods

  • DESeq2 is not shown in Panel B neither in Fig. 4 nor in Fig. 5 given its failure to produce any predictions at the selected False Discovery Rate (FDR) cutoffs

Read more

Summary

Introduction

Gene expression profiling has benefited medicine by providing clinically relevant insights at the molecular candidate and systems levels. To adapt the tools designed for populations into tools appropriate for individual-level inference requires either the use of replicates (mimicking the style of a model organism experiment and reducing the cross-sample noise to primarily stochastic and technical factors), a priori distribution and parameter assumptions, or data-derived models to create an expected distribution useful for comparison. In practice, it is not cost-effective and often entirely infeasible to obtain replicate samples from clinical procedures. Since DEG analysis methods were validated using replicates [3, 4], there remains a need to learn how well a DEG method designed for identifying differential expression would perform in real-world conditions and when replicates are unavailable (ss-DEG Methods)

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.