Efficient integration of generative topic models into discriminative classifiers using robust probabilistic kernels

Koffi Eddy Ihou,Nizar Bouguila,Wassim Bouachir

doi:10.1007/s10044-020-00917-1

Koffi Eddy Ihou, Nizar Bouguila + Show 1 more

Open Access

https://doi.org/10.1007/s10044-020-00917-1

Copy DOI

Abstract

We propose an alternative to the generative classifier that usually models both the class conditionals and class priors separately, and then uses the Bayes theorem to compute the posterior distribution of classes given the training set as a decision boundary. Because SVM (support vector machine) is not a probabilistic framework, it is really difficult to implement a direct posterior distribution-based discriminative classifier. As SVM lacks in full Bayesian analysis, we propose a hybrid (generative–discriminative) technique where the generative topic features from a Bayesian learning are fed to the SVM. The standard latent Dirichlet allocation topic model with its Dirichlet (Dir) prior could be defined as Dir–Dir topic model to characterize the Dirichlet placed on the document and corpus parameters. With very flexible conjugate priors to the multinomials such as generalized Dirichlet (GD) and Beta-Liouville (BL) in our proposed approach, we define two new topic models: the BL–GD and GD–BL. We take advantage of the geometric interpretation of our generative topic (latent) models that associate a K-dimensional manifold (K is the size of the topics) embedded into a V-dimensional feature space (word simplex) where V is the vocabulary size. Under this structure, the low-dimensional topic simplex (the subspace) defines a document as a single point on its manifold and associates each document with a single probability. The SVM, with its kernel trick, performs on these document probabilities in classification where it utilizes the maximum margin learning approach as a decision boundary. The key note is that points or documents that are close to each other on the manifold must belong to the same class. Experimental results with text documents and images show the merits of the proposed framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient integration of generative topic models into discriminative classifiers using robust probabilistic kernels

Abstract

Talk to us

Similar Papers

More From: Pattern Analysis and Applications

Lead the way for us

Journal: Pattern Analysis and Applications	Publication Date: Sep 30, 2020
Citations: 3

Similar Papers

Evaluating the Coverage and Depth of Latent Dirichlet Allocation Topic Model in Comparison with Human Coding of Qualitative Data: The Case of Education Research
Gaurav Nanda ... Yuzhe Zhou
Machine Learning and Knowledge Extraction | VOL. 5
Gaurav Nanda, et. al.Gaurav Nanda ... Yuzhe Zhou
14 May 2023
Machine Learning and Knowledge Extraction | VOL. 5

Analyze IMDb movies by sentiment and topic analysis
Ningjing Ouyang
Environment and Social Psychology | VOL. 8
Ningjing OuyangNingjing Ouyang
25 Oct 2023
Environment and Social Psychology | VOL. 8

Topic model allocation of conversational dialogue records by Latent Dirichlet Allocation
Jui-Feng Yeh ... Yi-Shiuan Tan
-
Jui-Feng Yeh, et. al.Jui-Feng Yeh ... Yi-Shiuan Tan
01 Dec 2014
01 Dec 2014

A topic model approach to identify and track emerging risks from beeswax adulteration in the media
Agnes Rortais ... Lidija Svečnjak
Food Control | VOL. 119
Agnes Rortais, et. al.Agnes Rortais ... Lidija Svečnjak
02 Jul 2020
Food Control | VOL. 119

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient integration of generative topic models into discriminative classifiers using robust probabilistic kernels

Abstract

Talk to us

Similar Papers

More From: Pattern Analysis and Applications