Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding.

Yiğit Uğur,Abdellatif Zaidi,George Arvanitakis

doi:10.3390/e22020213

Yiğit Uğur, Abdellatif Zaidi + Show 1 more

Open Access

https://doi.org/10.3390/e22020213

Copy DOI

Abstract

In this paper, we develop an unsupervised generative clustering framework that combines the variational information bottleneck and the Gaussian mixture model. Specifically, in our approach, we use the variational information bottleneck method and model the latent space as a mixture of Gaussians. We derive a bound on the cost function of our model that generalizes the Evidence Lower Bound (ELBO) and provide a variational inference type algorithm that allows computing it. In the algorithm, the coders’ mappings are parametrized using neural networks, and the bound is approximated by Markov sampling and optimized with stochastic gradient descent. Numerical results on real datasets are provided to support the efficiency of our method.

Highlights

Clustering consists of partitioning a given dataset into various groups based on some similarity metric, such as the Euclidean distance, L1 norm, L2 norm, L∞ norm, the popular logarithmic loss measure, or others
A key aspect is how to design a latent space that is amenable to accurate low-complexity unsupervised clustering, i.e., one that preserves only those features of the observed high-dimensional data that are useful for clustering while removing all redundant or non-relevant information
We provide a general cost function for the problem of the unsupervised clustering that we study here based on the variational Information Bottleneck (IB) framework; and we show that it generalizes the Evidence Lower Bound (ELBO) bound developed in [19]

Summary

Introduction

Clustering consists of partitioning a given dataset into various groups (clusters) based on some similarity metric, such as the Euclidean distance, L1 norm, L2 norm, L∞ norm, the popular logarithmic loss measure, or others. A key aspect is how to design a latent space that is amenable to accurate low-complexity unsupervised clustering, i.e., one that preserves only those features of the observed high-dimensional data that are useful for clustering while removing all redundant or non-relevant information. In order to achieve the outperforming accuracy: (i) we derive a cost function that contains the IB hyperparameter s that controls optimal trade-offs between the accuracy and regularization of the model; (ii) we use a lower bound approximation for the KL term in the cost function, that does not depend on the clustering assignment probability (note that the clustering assignment is usually not accurate in the beginning of the training process); and (iii) we tune the hyperparameter s by following an annealing approach that improves both the convergence and the accuracy of the proposed algorithm.

Proposed Model

Inference Network Model

Generative Network Model

Proposed Method

Brief Review of Variational Information Bottleneck for Unsupervised Learning

Proposed Algorithm

Effect of the Hyperparameter

Description of the Datasets Used

Network Settings and Other Parameters

Clustering Accuracy

Visualization on the Latent Space

Conclusions and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Feb 13, 2020
Citations: 14	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Distributed Variational Representation Learning.
Inaki Estella Aguerri ... Abdellatif Zaidi
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43
Inaki Estella Aguerri, et. al.Inaki Estella Aguerri ... Abdellatif Zaidi
19 Jul 2019
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43

Probabilistic model updating via variational Bayesian inference and adaptive Gaussian process modeling
Pinghe Ni ... Xiuli Du
Computer Methods in Applied Mechanics and Engineering | VOL. 383
Pinghe Ni, et. al.Pinghe Ni ... Xiuli Du
28 May 2021
Computer Methods in Applied Mechanics and Engineering | VOL. 383

Distributed Deep Variational Information Bottleneck
Abdellatif Zaidi ... Inaki Estella Aguerri
-
Abdellatif Zaidi, et. al.Abdellatif Zaidi ... Inaki Estella Aguerri
01 May 2020
01 May 2020

Unsupervised Image Categorization Based on Variational Autoencoder and Student’s-T Mixture Model
Yu Zhang ... Wentao Fan
-
Yu Zhang, et. al.Yu Zhang ... Wentao Fan
01 Dec 2019
01 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy