Probabilistic Non-Negative Matrix Factorization and Its Robust Extensions for Topic Modeling

Minnan Luo,Yi Yang,Qinghua Zheng,Alexander Hauptmann,Feiping Nie,Xiaojun Chang

doi:10.1609/aaai.v31i1.10832

Abstract

Traditional topic model with maximum likelihood estimate inevitably suffers from the conditional independence of words given the document’s topic distribution. In this paper, we follow the generative procedure of topic model and learn the topic-word distribution and topics distribution via directly approximating the word-document co-occurrence matrix with matrix decomposition technique. These methods include: (1) Approximating the normalized document-word conditional distribution with the documents probability matrix and words probability matrix based on probabilistic non-negative matrix factorization (NMF); (2) Since the standard NMF is well known to be non-robust to noises and outliers, we extended the probabilistic NMF of the topic model to its robust versions using l21-norm and capped l21-norm based loss functions, respectively. The proposed framework inherits the explicit probabilistic meaning of factors in topic models and simultaneously makes the conditional independence assumption on words unnecessary. Straightforward and efficient algorithms are exploited to solve the corresponding non-smooth and non-convex problems. Experimental results over several benchmark datasets illustrate the effectiveness and superiority of the proposed methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Probabilistic Non-Negative Matrix Factorization and Its Robust Extensions for Topic Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Feb 13, 2017
Citations: 22

Similar Papers

Hybrid Topic Cluster Models for Social Healthcare Data
K Rajendra Prasad ... R M
International Journal of Advanced Computer Science and Applications | VOL. 10
K Rajendra Prasad, et. al.K Rajendra Prasad ... R M
01 Jan 2019
International Journal of Advanced Computer Science and Applications | VOL. 10

Financial Topic Modeling Based on the BERT-LDA Embedding
Mei Zhou ... Jianwu Lin
-
Mei Zhou, et. al.Mei Zhou ... Jianwu Lin
25 Jul 2022
25 Jul 2022

Advances in Nonnegative Matrix and Tensor Factorization
A Cichocki ... M Mørup
Computational Intelligence and Neuroscience | VOL. 2008
A Cichocki, et. al.A Cichocki ... M Mørup
01 Jan 2008
Computational Intelligence and Neuroscience | VOL. 2008

Interpretable machine learning model using Extreme Gradient Boosting with Nonnegative Matrix Factorization improves the accuracy of arrhythmic risk prediction in Brugada Syndrome
G Tse ... K H C Li
European Heart Journal | VOL. 44
G Tse, et. al.G Tse ... K H C Li
09 Nov 2023
European Heart Journal | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Probabilistic Non-Negative Matrix Factorization and Its Robust Extensions for Topic Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence