A simple null model for inferences from network enrichment analysis.

Gustavo S Jeuken,Lukas Käll

doi:10.1371/journal.pone.0206864

Gustavo S Jeuken, Lukas Käll

Open Access

https://doi.org/10.1371/journal.pone.0206864

Copy DOI

Abstract

A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are known to be incomplete in their annotation, algorithmic efforts have been made to complement them with information from functional association networks. While the terminology varies in the literature, we will here refer to such methods as Network Enrichment Analysis (NEA). Traditionally, the significance of inferences from NEA has been assigned using a null model constructed from randomizations of the network. Here we instead argue for a null model that more directly relates to the set of genes being studied, and have designed one dynamic programming algorithm that calculates the score distribution of NEA scores that makes it possible to assign unbiased mid p values to inferences. We also implemented a random sampling method, carrying out the same task. We demonstrate that our method obtains a superior statistical calibration as compared to the popular NEA inference engine, BinoX, while also providing statistics that are easier to interpret.

Highlights

Over-Representation Analysis (ORA) is commonly used to infer function from sets of analytes such as genes, transcripts, proteins or metabolites [1,2,3]
We demonstrate that our method obtains a superior statistical calibration as compared to the popular Network Enrichment Analysis (NEA) inference engine, BinoX, while providing statistics that are easier to interpret
We implemented a Python program that reads network and pathway definition files and scores a query sets against a pathway according to Eq (3), using GeneSetDP and GeneSetMC described in the Algorithm section, that enabled us to assign p values according to Eq (2)

Summary

Introduction

Over-Representation Analysis (ORA) is commonly used to infer function from sets of analytes such as genes, transcripts, proteins or metabolites [1,2,3]. One prominent application of the technique is expression analysis, where ORA is regularly used to assess alternation in pathway activity by examining significantly different concentrations of analytes between biological conditions, such as disease state or treatment group. Most ORA methods are assessing the overlap between the investigated set of analytes, the query set, and a functional module, the pathway set, using hypergeometric test or a Fisher’s exact test. Variants such as Gene Set Enrichment Analysis (GSEA) [4] includes information on expression levels of the analytes of the query set.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Nov 9, 2018
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A simple null model for inferences from network enrichment analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

NEArender: an R package for functional interpretation of \u2018omics\u2019 data via network enrichment analysis
Ashwini Jeggari ... Andrey Alexeyenko
BMC Bioinformatics | VOL. 18
Ashwini Jeggari, et. al.Ashwini Jeggari ... Andrey Alexeyenko
01 Mar 2017
NEArender: an R package for functional interpretation of \u2018omics\u2019 data via network enrichment analysis
Ashwini Jeggari ... Andrey Alexeyenko

Null Versus Neutral Models: What's The Difference?
N J Gotelli ... Brian J Mcgill
Ecography | VOL. 29
N J Gotelli, et. al.N J Gotelli ... Brian J Mcgill
01 Oct 2006
Ecography | VOL. 29

NEAT: an efficient network enrichment analysis test.
Mirko Signorelli ... Ernst C Wit
BMC Bioinformatics | VOL. 17
Mirko Signorelli, et. al.Mirko Signorelli ... Ernst C Wit
05 Sep 2016
BMC Bioinformatics | VOL. 17

Distinguishing between driver and passenger mutations in individual cancer genomes by network enrichment analysis.
Simon Kebede Merid ... Andrey Alexeyenko
BMC Bioinformatics | VOL. 15
Simon Kebede Merid, et. al.Simon Kebede Merid ... Andrey Alexeyenko
19 Sep 2014
BMC Bioinformatics | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A simple null model for inferences from network enrichment analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE