Generalized Dirichlet Distribution Research Articles

Many traditional database’s processing schemes are batch-based with their abilities to utilize the entire information available at a time. Though, their limitations include storage (memory issues) and computational speed (often slow) for large scale applications. Another major disadvantage of the batch processing is that any small change or update in the database often requires a reevaluation using all the data at a time. This is not efficient as it is time consuming and exhausting. So, the approach seems to be a little obsolete in this new generation of fast computation. Furthermore and recently, the decrease in the cost of performing computations online promoted the increase in streaming and online-based models. In other words, new systems are taking advantage of the online setting to build models that are able to perform in real time and handle fast computations with real time updates. Traditional models could no longer scale to very large applications. So, much support has been given to online framework as these massive and nonstationary data could not keep up with the available storage. In the case of generative models, usually, the lack of flexible priors and sometimes the high complexities in the methods often hindered their performances. In addition and most importantly, many online-based models still use traditional inference approaches such as variational Bayes (VB) and Markov chain Monte Carlo (MCMC) which individually are not flexible enough as they suffer from either accuracy or efficiency. As a result, we propose in this paper, a new model that operates in online fashion with BL (Beta-Liouville) prior due to its flexibilities in topic correlation analysis. Carrying only very few parameters (compared to the generalized Dirichlet distribution, for instance), the BL is now coupled with a robust and stochastic generative process within a new hybrid inference that combines only the advantages of the VB and Gibbs sampling in the collapsed space. This insures an efficient, fast, and accurate processing. Experimental results with nonstationary datasets for face detection, image classification, and text documents processing show the merits of the new stochastic approach.

Read full abstract

This paper addresses the problem of identifying meaningful patterns and trends in data via clustering (i.e. automatically dividing a data set into meaningful homogenous sub-groups such that the data within the same sub-group are very similar, and data in different sub-groups are very different). The clustering framework that we propose is based on the generalized Dirichlet distribution, which is widely accepted as a flexible modeling approach, and a hierarchical Dirichlet process mixture prior. A main advantage of the adopted hierarchical Dirichlet process is that it provides a principled elegant nonparametric Bayesian approach to model selection by supposing that the number of mixture components can go to infinity. In addition to capturing the structure of the data, the combination of hierarchical Dirichlet process and generalized Dirichlet distribution is shown to offer a natural efficient solution to the feature selection problem when dealing with high-dimensional data. We develop two variational learning approaches (i.e. batch and incremental) for learning the parameters of the proposed model. The batch algorithm examines the entire data set at once while the incremental one learns the model one step at a time (i.e. update the model’s parameters each time new data are introduced). The utility of the proposed approach is demonstrated on real applications namely face detection, facial expression recognition, human gesture recognition, and off-line writer identification. The obtained results show clearly the merits of our statistical framework.

Read full abstract

Generalized Dirichlet Distribution Research Articles

Related Topics

Articles published on Generalized Dirichlet Distribution

Parallel inference for cross-collection latent generalized Dirichlet allocation model and applications

Online short text clustering using infinite extensions of discrete mixture models

Generalized Dirichlet Distribution Based on Confluent Hypergeometric Series

Decay Branch Ratio Sampling Method with Dirichlet Distribution

Smoothed Generalized Dirichlet: A Novel Count-Data Model for Detecting Emotional States

Accelerating Extreme Search of Multidimensional Functions Based on Natural Gradient Descent with Dirichlet Distributions

Multinomial naïve Bayesian classifier with generalized Dirichlet priors for high-dimensional imbalanced data

A Fractional Generalization of the Dirichlet Distribution and Related Distributions

Stochastic topic models for large scale and nonstationary data

Variational-based latent generalized Dirichlet allocation model in the collapsed space and applications

Data-free metrics for Dirichlet and generalized Dirichlet mixture-based HMMs – A practical study

Small Area Estimation of Proportions with Constraint for National Resources Inventory Survey

Proportional data modeling via entropy-based variational bayes learning of mixture models

Online Learning of Hierarchical Pitman-Yor Process Mixture of Generalized Dirichlet Distributions With Feature Selection.

Expectation propagation learning of a Dirichlet process mixture of Beta-Liouville distributions for proportional data clustering

A hierarchical Dirichlet process mixture of generalized Dirichlet distributions for feature selection

Variational learning of hierarchical infinite generalized Dirichlet mixture models and applications

A variational Bayes model for count data learning and classification

Bayesian nowcasting during the STEC O104:H4 outbreak in Germany, 2011.

Unsupervised clustering and feature weighting based on Generalized Dirichlet mixture modeling

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Generalized Dirichlet Distribution Research Articles

Related Topics

Articles published on Generalized Dirichlet Distribution

Parallel inference for cross-collection latent generalized Dirichlet allocation model and applications

Online short text clustering using infinite extensions of discrete mixture models

Generalized Dirichlet Distribution Based on Confluent Hypergeometric Series

Decay Branch Ratio Sampling Method with Dirichlet Distribution

Smoothed Generalized Dirichlet: A Novel Count-Data Model for Detecting Emotional States

Accelerating Extreme Search of Multidimensional Functions Based on Natural Gradient Descent with Dirichlet Distributions

Multinomial naïve Bayesian classifier with generalized Dirichlet priors for high-dimensional imbalanced data

A Fractional Generalization of the Dirichlet Distribution and Related Distributions

Stochastic topic models for large scale and nonstationary data

Variational-based latent generalized Dirichlet allocation model in the collapsed space and applications

Data-free metrics for Dirichlet and generalized Dirichlet mixture-based HMMs – A practical study

Small Area Estimation of Proportions with Constraint for National Resources Inventory Survey

Proportional data modeling via entropy-based variational bayes learning of mixture models

Online Learning of Hierarchical Pitman-Yor Process Mixture of Generalized Dirichlet Distributions With Feature Selection.

Expectation propagation learning of a Dirichlet process mixture of Beta-Liouville distributions for proportional data clustering

A hierarchical Dirichlet process mixture of generalized Dirichlet distributions for feature selection

Variational learning of hierarchical infinite generalized Dirichlet mixture models and applications

A variational Bayes model for count data learning and classification

Bayesian nowcasting during the STEC O104:H4 outbreak in Germany, 2011.

Unsupervised clustering and feature weighting based on Generalized Dirichlet mixture modeling