Bayesian analysis for mixtures of discrete distributions with a non-parametric component

Baba B Alhaji,Hongsheng Dai,Yoshiko Hayashi,Veronica Vinciotti,Andrew Harrison,Berthold Lausen

doi:10.1080/02664763.2015.1100594

Abstract

Bayesian finite mixture modelling is a flexible parametric modelling approach for classification and density fitting. Many areas of application require distinguishing a signal from a noise component. In practice, it is often difficult to justify a specific distribution for the signal component; therefore, the signal distribution is usually further modelled via a mixture of distributions. However, modelling the signal as a mixture of distributions is computationally non-trivial due to the difficulties in justifying the exact number of components to be used and due to the label switching problem. This paper proposes the use of a non-parametric distribution to model the signal component. We consider the case of discrete data and show how this new methodology leads to more accurate parameter estimation and smaller false non-discovery rate. Moreover, it does not incur the label switching problem. We show an application of the method to data generated by ChIP-sequencing experiments.

Highlights

Introduction and motivationFinite mixture modelling can be used to describe data obtained from different populations
In the last two decades, many new methodologies have been proposed for the Bayesian analysis of finite mixture models, such as Diebolt and Robert (1994), West
The existing literature has shown that finite mixture models can be inferred in a simple and effective way in a Bayesian estimation framework, persistent challenges still exist in the diagnostic of Markov Chain Monte Carlo (MCMC) convergence due to the following aspects

Summary

Introduction and motivation

Finite mixture modelling can be used to describe data obtained from different populations. Many authors have devised different methodologies for estimating the number of components in a Bayesian finite mixture models, for example reversible jump MCMC (Richardson and Green, 1997) and Birth and Death MCMC (Stephens, 2000a; Nobile et al, 2007). Another approach to deal with the unknown number of components is to use a mixture of Dirichlet processes (Antoniak, 1974; Escobar and West, 1995), which allows an infinite number of components. This motivates our study, as we discuss in detail in the following subsection

Motivation of the study

The contribution and structure of the paper

The new methodology

The interpretation of the model

Scenario 1

Scenario 2

ChIP-seq data

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian analysis for mixtures of discrete distributions with a non-parametric component

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Applied Statistics

Lead the way for us

Journal: Journal of Applied Statistics	Publication Date: Oct 29, 2015
License type: cc-by

Similar Papers

Analysis of ChIP-seq Data Via Bayesian Finite Mixture Models with a Non-parametric Component
Baba B Alhaji ... Andrew Harrison
-
Baba B Alhaji, et. al.Baba B Alhaji ... Andrew Harrison
01 Jan 2015
01 Jan 2015

Bayesian mixture model based clustering of replicated microarray data
M Medvedovic ... R.E Bumgarner
Bioinformatics | VOL. 20
M Medvedovic, et. al.M Medvedovic ... R.E Bumgarner
10 Feb 2004
Bioinformatics | VOL. 20

Allocation Variable-Based Probabilistic Algorithm to Deal with Label Switching Problem in Bayesian Mixture Models.
Jia-Chiun Pan ... Cathy W.S Chen
PloS one | VOL. 10
Jia-Chiun Pan, et. al.Jia-Chiun Pan ... Cathy W.S Chen
12 Oct 2015
PloS one | VOL. 10

A Better Alternative to Non-parametric Approaches for Adjusting for Covariate Measurement Errors in Logistic Regression
Shahadut Hossain ... A H M Saidul Hasan
Communications in Statistics - Simulation and Computation | VOL. 45
Shahadut Hossain, et. al.Shahadut Hossain ... A H M Saidul Hasan
15 Sep 2014
Communications in Statistics - Simulation and Computation | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian analysis for mixtures of discrete distributions with a non-parametric component

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Applied Statistics