Generalized empirical Bayesian methods for discovery of differential data in high-throughput biology.

Thomas J Hardcastle

doi:10.1093/bioinformatics/btv569

Abstract

High-throughput data are now commonplace in biological research. Rapidly changing technologies and application mean that novel methods for detecting differential behaviour that account for a 'large P, small n' setting are required at an increasing rate. The development of such methods is, in general, being done on an ad hoc basis, requiring further development cycles and a lack of standardization between analyses. We present here a generalized method for identifying differential behaviour within high-throughput biological data through empirical Bayesian methods. This approach is based on our baySeq algorithm for identification of differential expression in RNA-seq data based on a negative binomial distribution, and in paired data based on a beta-binomial distribution. Here we show how the same empirical Bayesian approach can be applied to any parametric distribution, removing the need for lengthy development of novel methods for differently distributed data. Comparisons with existing methods developed to address specific problems in high-throughput biological data show that these generic methods can achieve equivalent or better performance. A number of enhancements to the basic algorithm are also presented to increase flexibility and reduce computational costs. The methods are implemented in the R baySeq (v2) package, available on Bioconductor http://www.bioconductor.org/packages/release/bioc/html/baySeq.html. tjh48@cam.ac.uk Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalized empirical Bayesian methods for discovery of differential data in high-throughput biology.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Oct 1, 2015
Citations: 35

Similar Papers

Towards automated derivation of biological pathways using high-throughput biological data
Yu Chen ... Trupti Joshi
-
Yu Chen, et. al. Yu Chen ... Trupti Joshi
10 Mar 2003
10 Mar 2003

DEXUS: identifying differential expression in RNA-Seq studies with unknown conditions
Günter Klambauer ... Sepp Hochreiter
Nucleic Acids Research | VOL. 41
Günter Klambauer, et. al.Günter Klambauer ... Sepp Hochreiter
17 Sep 2013
Nucleic Acids Research | VOL. 41

GENOME-SCALE PROTEIN FUNCTION PREDICTION IN YEAST SACCHAROMYCES CEREVISIAE THROUGH INTEGRATING MULTIPLE SOURCES OF HIGH-THROUGHPUT DATA
Yu Chen ... Dong Xu
-
Yu Chen, et. al.Yu Chen ... Dong Xu
01 Dec 2004
01 Dec 2004

Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae.
Yu Chen
Nucleic Acids Research | VOL. 32
Yu ChenYu Chen
29 Nov 2004
Nucleic Acids Research | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalized empirical Bayesian methods for discovery of differential data in high-throughput biology.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics