Summarizing Nonparametric Bayesian Mixture Posteriors—Sliced Optimal Transport Metrics for Gaussian Mixtures

  • TL;DR
  • Abstract
  • Literature Map
  • Similar Papers
TL;DR

This paper introduces a model-agnostic approach for summarizing nonparametric Bayesian mixture posteriors by estimating the mixing measure using sliced Wasserstein distances, including two variants for Gaussian mixtures, enabling inference on mixture density and partition with validated density estimates.

Abstract
Translate article icon Translate Article Star icon

Existing methods to summarize posterior inference for mixture models focus on identifying a point estimate of the implied random partition for clustering and validating it using clustering-based loss functions (Wade and Ghahramani; Dahl, Johnson, and Müller). We propose a novel approach for summarizing posterior inference in nonparametric Bayesian mixture models, prioritizing estimation of the mixing measure as an inference target. Moreover, we propose to validate the estimation of the partition via a new perspective which is based on the implied density estimate. One of the key features is the model-agnostic nature of the approach, which remains valid under arbitrarily complex dependence structures in the underlying sampling model. Using a decision-theoretic framework, the proposed methods identify a point estimate using loss functions defined as discrepancies between mixing measures. Estimating the mixing measure then implies inference on the mixture density and the random partition. To define a discrepancy between mixing measures we exploit the discrete nature of the mixing measure and use a version of sliced Wasserstein distance. We introduce two variants for Gaussian mixtures. The first, mixed sliced Wasserstein, applies generalized geodesic projections on a product of Euclidean space and the manifold of symmetric positive definite matrices. The second, sliced mixture Wasserstein, leverages the linearity of Gaussian mixture measures to define a projection.

Similar Papers
  • Research Article
  • Cite Count Icon 15
  • 10.1016/j.engappai.2020.103781
Spherical data clustering and feature selection through nonparametric Bayesian mixture models with von Mises distributions
  • Jun 29, 2020
  • Engineering Applications of Artificial Intelligence
  • Wentao Fan + 1 more

Spherical data clustering and feature selection through nonparametric Bayesian mixture models with von Mises distributions

  • Book Chapter
  • Cite Count Icon 31
  • 10.1093/acprof:oso/9780199694587.003.0010
Nonparametric Bayesian Networks*
  • Oct 6, 2011
  • Katja Ickstadt + 6 more

A convenient way of modelling complex interactions is by employing graphs or networks which correspond to conditional independence structures in an underlying statistical model. One main class of models in this regard are Bayesian networks, which have the drawback of making parametric assumptions. Bayesian nonparametric mixture models offer a possibility to overcome this limitation, but have hardly been used in combination with networks. This manuscript bridges this gap by introducing nonparametric Bayesian network models. We review (parametric) Bayesian networks, in particular Gaussian Bayesian networks, from a Bayesian perspective as well as nonparametric Bayesian mixture models. Afterwards these two modelling approaches are combined into nonparametric Bayesian networks. The new models are compared both to Gaussian Bayesian networks and to mixture models in a simulation study, where it turns out that the nonparametric network models perform favourably in non‐Gaussian situations. The new models are also applied to an example from systems biology, namely finding modules within the MAPK cascade.

  • Book Chapter
  • Cite Count Icon 1
  • 10.1007/978-3-319-18968-0_8
Clustering and Feature Allocation
  • Jan 1, 2015
  • Peter Müller + 3 more

An important byproduct of inference in discrete mixture models is an implied random partition of experimental units. In fact, such random partitions are the main inference targets for many recently published applications of nonparametric Bayesian discrete mixture models. In this chapter we systematically consider the use of nonparametric Bayesian priors for inference on such random partitions. Many scientific inference problems are formalized as the related, more general problem of feature allocation. That is, inference on possibly overlapping random subsets of experimental units. We introduce some examples from data analysis for bioinformatics data and introduce the Polya urn model, product partition models, model based clustering and the Indian buffet process prior.

  • Conference Article
  • Cite Count Icon 58
  • 10.1109/cvpr.2008.4587370
Incremental learning of nonparametric Bayesian mixture models
  • Jun 1, 2008
  • Ryan Gomes + 2 more

Clustering is a fundamental task in many vision applications. To date, most clustering algorithms work in a batch setting and training examples must be gathered in a large group before learning can begin. Here we explore incremental clustering, in which data can arrive continuously. We present a novel incremental model-based clustering algorithm based on nonparametric Bayesian methods, which we call memory bounded variational Dirichlet process (MB-VDP). The number of clusters are determined flexibly by the data and the approach can be used to automatically discover object categories. The computational requirements required to produce model updates are bounded and do not grow with the amount of data processed. The technique is well suited to very large datasets, and we show that our approach outperforms existing online alternatives for learning nonparametric Bayesian mixture models.

  • Research Article
  • Cite Count Icon 34
  • 10.1016/j.bpj.2016.04.053
Bayesian Statistical Inference in Ion-Channel Models with Exact Missed Event Correction
  • Jul 1, 2016
  • Biophysical Journal
  • Michael Epstein + 3 more

Bayesian Statistical Inference in Ion-Channel Models with Exact Missed Event Correction

  • Research Article
  • Cite Count Icon 12
  • 10.1016/j.csda.2019.02.008
Modal posterior clustering motivated by Hopfield’s network
  • Feb 23, 2019
  • Computational Statistics & Data Analysis
  • Ruth Fuentes-García + 2 more

Modal posterior clustering motivated by Hopfield’s network

  • Research Article
  • 10.14429/dsj.65.8573
Evolution of a Hybrid Model for an Effective Perimeter Security Device
  • Nov 10, 2015
  • Defence Science Journal
  • S Selvakumar + 1 more

Clustering and classification models, or hybrid models are the most widely used models that can handle the diverse nature of NIDS dataset. Dirichlet process clustering technique is a non-parametric Bayesian mixture model that considers the data distribution of the dataset for the formation of distinct clusters. The number of clusters is not known a priori and it differs across different datasets. Determining the number of clusters based on the distribution of data instances can increase the performance of the model. Naive Bayes model, a supervised learning classification technique, maintains a better computational efficiency, by reducing the training time. In this paper, we propose a hybrid model to exploit the positive aspect of proper clustering of data instances and the computational efficiency in building a NIDS. RIPPER algorithm is used to extract rules from the traffic description for updation of the rule database. Experiments were conducted in the KDD CUP’99 and SSENet-2011 datasets to study the performance of the proposed model. Also, a comparison of three hybrid methods with the proposed hybrid model was carried out. The results showed that the proposed hybrid model is superior in building a robust perimeter security device.

  • Research Article
  • Cite Count Icon 30
  • 10.1007/s11222-014-9519-4
An adaptive truncation method for inference in Bayesian nonparametric models
  • Dec 9, 2014
  • Statistics and Computing
  • J E Griffin

Many exact Markov chain Monte Carlo algorithms have been developed for posterior inference in Bayesian nonparametric models which involve infinite-dimensional priors. However, these methods are not generic and special methodology must be developed for different classes of prior or different models. Alternatively, the infinite-dimensional prior can be truncated and standard Markov chain Monte Carlo methods used for inference. However, the error in approximating the infinite-dimensional posterior can be hard to control for many models. This paper describes an adaptive truncation method which allows the level of the truncation to be decided by the algorithm and so can avoid large errors in approximating the posterior. A sequence of truncated priors is constructed which are sampled using Markov chain Monte Carlo methods embedded in a sequential Monte Carlo algorithm. Implementational details for infinite mixture models with stick-breaking priors and normalized random measures with independent increments priors are discussed. The methodology is illustrated on infinite mixture models, a semiparametric linear mixed model and a nonparametric time series model.

  • Research Article
  • Cite Count Icon 4
  • 10.1080/00401706.2019.1635532
Bayesian Nonparametric Joint Mixture Model for Clustering Spatially Correlated Time Series
  • Jul 22, 2019
  • Technometrics
  • Youngmin Lee + 1 more

We develop a Bayesian nonparametric joint mixture model for clustering spatially correlated time series based on both spatial and temporal similarities. In the temporal perspective, the pattern of a time series is flexibly modeled as a mixture of Gaussian processes, with a Dirichlet process (DP) prior over mixture components. In the spatial perspective, the spatial location is incorporated as a feature for clustering, like a time series being incorporated as a feature. Namely, we model the spatial distribution of each cluster as a DP Gaussian mixture density. For the proposed model, the number of clusters does not need to be specified in advance, but rather is automatically determined during the clustering procedure. Moreover, the spatial distribution of each cluster can be flexibly modeled with multiple modes, without determining the number of modes or specifying spatial neighborhood structures in advance. Variational inference is employed for the efficient posterior computation of the proposed model. We validate the proposed model using simulated and real-data examples. Supplementary materials for the article are available online.

  • Research Article
  • Cite Count Icon 7
  • 10.1088/1742-5468/2015/10/p10004
Network structure exploration via Bayesian nonparametric models
  • Oct 1, 2015
  • Journal of Statistical Mechanics: Theory and Experiment
  • Y Chen + 4 more

.Complex networks provide a powerful mathematical representation of complex systems in nature and society. To understand complex networks, it is crucial to explore their internal structures, also called structural regularities. The task of network structure exploration is to determine how many groups there are in a complex network and how to group the nodes of the network. Most existing structure exploration methods need to specify either a group number or a certain type of structure when they are applied to a network. In the real world, however, the group number and also the certain type of structure that a network has are usually unknown in advance. To explore structural regularities in complex networks automatically, without any prior knowledge of the group number or the certain type of structure, we extend a probabilistic mixture model that can handle networks with any type of structure but needs to specify a group number using Bayesian nonparametric theory. We also propose a novel Bayesian nonparametric model, called the Bayesian nonparametric mixture (BNPM) model. Experiments conducted on a large number of networks with different structures show that the BNPM model is able to explore structural regularities in networks automatically with a stable, state-of-the-art performance.

  • Conference Article
  • Cite Count Icon 4
  • 10.21437/interspeech.2017-339
Comparison of Non-Parametric Bayesian Mixture Models for Syllable Clustering and Zero-Resource Speech Processing
  • Aug 20, 2017
  • Shreyas Seshadri + 2 more

Zero-resource speech processing (ZS) systems aim to learn structural representations of speech without access to labeled data. A starting point for these systems is the extraction of syllable tokens utilizing the rhythmic structure of a speech signal. Several recent ZS systems have therefore focused on clustering such syllable tokens into linguistically meaningful units. These systems have so far used heuristically set number of clusters, which can, however, be highly dataset dependent and cannot be optimized in actual unsupervised settings. This paper focuses on improving the flexibility of ZS systems using Bayesian non-parametric (BNP) mixture models that are capable of simultaneously learning the cluster models as well as their number based on the properties of the dataset. We also compare different model design choices, namely priors over the weights and the cluster component models, as the impact of these choices is rarely reported in the previous studies. Experiments are conducted using conversational speech from several languages. The models are first evaluated in a separate syllable clustering task and then as a part of a full ZS system in order to examine the potential of BNP methods and illuminate the relative importance of different model design choices.

  • Discussion
  • Cite Count Icon 1
  • 10.1080/03610918.2019.1710190
Prediction of the nonsampled units in survey design with the finite population using Bayesian nonparametric mixture model
  • Apr 9, 2020
  • Communications in Statistics - Simulation and Computation
  • S Rahnamay Kordasiabi + 1 more

In the sampling approaches framework, the combination of the information on the sizes of the nonsampled units can help to attain better estimators by using semiparametric models. Sometimes, the design variables that have an important role in the sampling mechanism are not available. Hence predictions require to be adapted for the consequence of selection. To infer the population mean in a sample survey, we study Bayesian nonparametric model with Dirichlet process prior by considering the inverse-probability weights as the only available information. Indeed, we present a Bayesian nonparametric mixture of regression models for the survey outcomes with the weights as predictors and impute the nonsampled units. Finally, the model-based estimators that are derived from the Bayesian (parametric and nonparametric) methods are compared with the design-based estimator based on the simulation approaches.

  • Research Article
  • 10.1186/s12859-025-06285-z
Computationally efficient multi-sample flow cytometry data analysis using Gaussian mixture models
  • Oct 23, 2025
  • BMC Bioinformatics
  • Philip Rutten + 4 more

BackgroundAn important challenge in flow cytometry (FCM) data analysis is making comparisons of corresponding cell populations across multiple FCM samples. An interesting solution is creating a statistical mixture model for multiple samples simultaneously, as such a multi-sample model can characterize a heterogeneous set of samples, and facilitates direct comparison of cell populations across the data samples. The multi-sample approach to statistical mixture modeling has been explored in a number of reports, mostly within a Bayesian framework and with high computational complexity. Although these approaches are effective, they are also computationally demanding, and therefore do not relate well to the requirement of scalability, which is essential in the multi-sample setting. This limits their utility in the analysis of large sets of large FCM samples.ResultsWe show that basic Gaussian mixture models can be extended to large data sets consisting of multiple samples, using a computationally efficient implementation of the expectation-maximization algorithm. We show that the multi-sample Gaussian mixture model (MSGMM) is competitive with other models, in both rare cell detection and sample classification accuracy. This allows us to further explore the utility of MSGMMs in the analysis of heterogeneous sets of samples. We demonstrate how simple heuristics on MSGMM model output can directly reveal structural patterns in a collection of FCM samples.ConclusionsWe recover the efficiency and utility of the basic MSGMM which underlies more complex and non-parametric Bayesian hierarchical mixture models. The possibility of fitting GMMs to large sets of FCM samples provides opportunities for the discovery of associations between sample composition and sample meta-data such as treatment responses and clinical outcomes.Supplementary InformationThe online version contains supplementary material available at 10.1186/s12859-025-06285-z.

  • PDF Download Icon
  • Conference Article
  • Cite Count Icon 27
  • 10.18653/v1/e17-1076
Event extraction from Twitter using Non-Parametric Bayesian Mixture Model with Word Embeddings
  • Jan 1, 2017
  • Deyu Zhou + 2 more

To extract structured representations of newsworthy events from Twitter, unsupervised models typically assume that tweets involving the same named entities and expressed using similar words are likely to belong to the same event. Hence, they group tweets into clusters based on the co-occurrence patterns of named entities and topical keywords. However, there are two main limitations. First, they require the number of events to be known beforehand, which is not realistic in practical applications. Second, they don't recognise that the same named entity might be referred to by multiple mentions and tweets using different mentions would be wrongly assigned to different events. To overcome these limitations, we propose a non-parametric Bayesian mixture model with word embeddings for event extraction, in which the number of events can be inferred automatically and the issue of lexical variations for the same named entity can be dealt with properly. Our model has been evaluated on three datasets with sizes ranging between 2,499 and over 60 million tweets. Experimental results show that our model outperforms the baseline approach on all datasets by 5-8% in F-measure.

  • Research Article
  • Cite Count Icon 10
  • 10.1007/s11222-023-10207-5
Modularized Bayesian analyses and cutting feedback in likelihood-free inference
  • Jan 19, 2023
  • Statistics and Computing
  • Atlanta Chakraborty + 4 more

There has been much recent interest in modifying Bayesian inference for misspecified models so that it is useful for specific purposes. One popular modified Bayesian inference method is “cutting feedback” which can be used when the model consists of a number of coupled modules, with only some of the modules being misspecified. Cutting feedback methods represent the full posterior distribution in terms of conditional and sequential components, and then modify some terms in such a representation based on the modular structure for specification or computation of a modified posterior distribution. The main goal of this is to avoid contamination of inferences for parameters of interest by misspecified modules. Computation for cut posterior distributions is challenging, and here we consider cutting feedback for likelihood-free inference based on Gaussian mixture approximations to the joint distribution of parameters and data summary statistics. We exploit the fact that marginal and conditional distributions of a Gaussian mixture are Gaussian mixtures to give explicit approximations to marginal or conditional posterior distributions so that we can easily approximate cut posterior analyses. The mixture approach allows repeated approximation of posterior distributions for different data based on a single mixture fit. This is important for model checks which aid in the decision of whether to “cut”. A semi-modular approach to likelihood-free inference where feedback is partially cut is also developed. The benefits of the method are illustrated on two challenging examples, a collective cell spreading model and a continuous time model for asset returns with jumps.

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant