MetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets.

Eve Syrkin Wurtele,Karin Dorman,Manhoi Hur,Urminder Singh

doi:10.1093/nar/gkz1209

Eve Syrkin Wurtele, Karin Dorman + Show 2 more

Open Access

https://doi.org/10.1093/nar/gkz1209

Copy DOI

Journal: Nucleic Acids Research	Publication Date: Jan 20, 2020
Citations: 19	License type: CC BY 4.0

Affiliation: Iowa State University

Abstract

The diverse and growing omics data in public domains provide researchers with tremendous opportunity to extract hidden, yet undiscovered, knowledge. However, the vast majority of archived data remain unused. Here, we present MetaOmGraph (MOG), a free, open-source, standalone software for exploratory analysis of massive datasets. Researchers, without coding, can interactively visualize and evaluate data in the context of its metadata, honing-in on groups of samples or genes based on attributes such as expression values, statistical associations, metadata terms and ontology annotations. Interaction with data is easy via interactive visualizations such as line charts, box plots, scatter plots, histograms and volcano plots. Statistical analyses include co-expression analysis, differential expression analysis and differential correlation analysis, with significance tests. Researchers can send data subsets to R for additional analyses. Multithreading and indexing enable efficient big data analysis. A researcher can create new MOG projects from any numerical data; or explore an existing MOG project. MOG projects, with history of explorations, can be saved and shared. We illustrate MOG by case studies of large curated datasets from human cancer RNA-Seq, where we identify novel putative biomarker genes in different tumors, and microarray and metabolomics data from Arabidopsis thaliana. MOG executable and code: http://metnetweb.gdcb.iastate.edu/ and https://github.com/urmi-21/MetaOmGraph/.

Highlights

Public data repositories store petabytes of raw and processed data produced using microarray [1], RNA-seq [2], and mass spectrometry for small molecules [3] and proteins [4]
Out of the 111 genes we identified as increasing during progression of Kidney renal clear cell carcinoma (KIRC) or Kidney renal papillary cell carcinoma (KIRP), 56 have been described as unfavourable prognostic for renal cancer by The Human Protein Atlas (THPA) (Supplementary Table S30)
Out of the 79 genes we identified as decreasing with cancer progression in KIRC or KIRP, 39 were labeled as prognostic favourable for renal cancer by THPA (Supplementary Table S30)

Summary

Introduction

Public data repositories store petabytes of raw and processed data produced using microarray [1], RNA-seq [2], and mass spectrometry for small molecules [3] and proteins [4] These data represent multiple species, tissues, genotypes, and conditions; some are the results of groundbreaking research. Integrative analysis of data from the multiple studies representing diverse biological conditions is key to fully exploit these vast data resources for scientific discovery [5, 6] Such analysis allows efficient reuse and recycling of these available data and its metadata [1, 5, 7, 8].

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

Using ExpressAnalyst for Comprehensive Gene Expression Analysis in Model and Non-Model Organisms.
Jessica Ewald ... Yao Lu
Current Protocols | VOL. 3
Jessica Ewald, et. al.Jessica Ewald ... Yao Lu
01 Nov 2023
Current Protocols | VOL. 3

Interactive Analysis, Exploration, and Visualization of RNA-Seq Data with SeqCVIBE.
Efthimios Bothos ... Pantelis Hatzis
Methods and protocols | VOL. 5
Efthimios Bothos, et. al.Efthimios Bothos ... Pantelis Hatzis
18 Mar 2022
Methods and protocols | VOL. 5

Meeting Big Data challenges with visual analytics
Victoria Louise Lemieux ... Dr Anne Thurston
Records Management Journal | VOL. 24
Victoria Louise Lemieux, et. al.Victoria Louise Lemieux ... Dr Anne Thurston
15 Jul 2014
Records Management Journal | VOL. 24

Secondary Data Analysis of Large Data Sets in Urology: Successes and Errors to Avoid
Bruce J Schlomer ... Hillary L Copp
Journal of Urology | VOL. 191
Bruce J Schlomer, et. al.Bruce J Schlomer ... Hillary L Copp
17 Oct 2013
Journal of Urology | VOL. 191

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nucleic Acids Research