A Sequential Algorithm for Fast Fitting of Dirichlet Process Mixture Models

Xiaole Zhang,David J Nott,Christopher Yau,Ajay Jasra

doi:10.1080/10618600.2013.870906

Abstract

In this article, we propose an improvement on the sequential updating and greedy search (SUGS) algorithm for fast fitting of Dirichlet process mixture models. The SUGS algorithm provides a means for very fast approximate Bayesian inference for mixture data which is particularly of use when datasets are so large that many standard Markov chain Monte Carlo (MCMC) algorithms cannot be applied efficiently, or take a prohibitively long time to converge. In particular, these ideas are used to initially interrogate the data, and to refine models such that one can potentially apply exact data analysis later on. SUGS relies upon sequentially allocating data to clusters and proceeding with an update of the posterior on the subsequent allocations and parameters which assumes this allocation is correct. Our modification softens this approach, by providing a probability distribution over allocations, with a similar computational cost; this approach has an interpretation as a variational Bayes procedure and hence we term it variational SUGS (VSUGS). It is shown in simulated examples that VSUGS can outperform, in terms of density estimation and classification, a version of the SUGS algorithm in many scenarios. In addition, we present a data analysis for flow cytometry data, and SNP data via a three-class Dirichlet process mixture model, illustrating the apparent improvement over the original SUGS algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Sequential Algorithm for Fast Fitting of Dirichlet Process Mixture Models

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics

Lead the way for us

Journal: Journal of Computational and Graphical Statistics	Publication Date: Oct 2, 2014
Citations: 20

Similar Papers

Fast approximate inference for variable selection in Dirichlet process mixtures, with an application to pan-cancer proteomics.
Oliver M Crook ... Laurent Gatto
Statistical Applications in Genetics and Molecular Biology | VOL. 18
Oliver M Crook, et. al.Oliver M Crook ... Laurent Gatto
12 Dec 2019
Statistical Applications in Genetics and Molecular Biology | VOL. 18

Fast Bayesian Inference in Dirichlet Process Mixture Models
Lianming Wang ... David B Dunson
Journal of Computational and Graphical Statistics | VOL. 20
Lianming Wang, et. al.Lianming Wang ... David B Dunson
01 Jan 2010
Journal of Computational and Graphical Statistics | VOL. 20

Markov chain Monte Carlo estimation of a mixture item response theory model
Sun-Joo Cho ... Seock-Ho Kim
Journal of Statistical Computation and Simulation | VOL. 83
Sun-Joo Cho, et. al.Sun-Joo Cho ... Seock-Ho Kim
01 Feb 2013
Journal of Statistical Computation and Simulation | VOL. 83

Sequential Monte Carlo methods for epidemic data

-

18 Jul 2020
18 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Sequential Algorithm for Fast Fitting of Dirichlet Process Mixture Models

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics