Snapshot ensembles of non-negative matrix factorization for stability of topic modeling

Jipeng Qiang,Yun Li,Wei Liu,Yunhao Yuan

doi:10.1007/s10489-018-1192-4

Abstract

Recently many topic models such as Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF) have made important progress towards generating high-level knowledge from a large corpus. However, these algorithms based on random initialization generate different results on the same corpus using the same parameters, denoted as instability problem. For solving this problem, ensembles of NMF are known to be much more stable and accurate than individual NMFs. However, training multiple NMFs for ensembling is computationally expensive. In this paper, we propose a novel scheme to obtain the seemingly contradictory goal of ensembling multiple NMFs without any additional training cost. We train a single NMF algorithm with the cyclical learning rate schedule, which can converge to several local minima along its optimization path. We save the results to the ensemble when the model converges, and then restart the optimization with a large learning rate that can help escape the current local minimum. Based on experiments performed on text corpora using a number of measures to assess, our method can reduce instability at no additional training cost, while simultaneously yields more accurate topic models than traditional single methods and ensemble methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Snapshot ensembles of non-negative matrix factorization for stability of topic modeling

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Journal: Applied Intelligence	Publication Date: May 10, 2018
Citations: 14

Similar Papers

Comparative Analysis of Research Papers Categorization using LDA and NMF Approaches
Sandeep Preetham M C ... Darukumalli Sai Tharun Reddy
-
Sandeep Preetham M C, et. al.Sandeep Preetham M C ... Darukumalli Sai Tharun Reddy
20 Nov 2022
20 Nov 2022

Hybrid Topic Cluster Models for Social Healthcare Data
K Rajendra Prasad ... R M
International Journal of Advanced Computer Science and Applications | VOL. 10
K Rajendra Prasad, et. al.K Rajendra Prasad ... R M
01 Jan 2019
International Journal of Advanced Computer Science and Applications | VOL. 10

BERT for Arabic Topic Modeling: An Experimental Study on BERTopic Technique
Abeer Abuzayed ... Hend Al-Khalifa
Procedia Computer Science | VOL. 189
Abeer Abuzayed, et. al.Abeer Abuzayed ... Hend Al-Khalifa
01 Jan 2020
Procedia Computer Science | VOL. 189

Exploring Latent Themes-Analysis of various Topic Modelling Algorithms
Reetesh Kumar Srivastava ... Shalini Sharma
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Reetesh Kumar Srivastava, et. al. Reetesh Kumar Srivastava ... Shalini Sharma
21 Jun 2023
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Snapshot ensembles of non-negative matrix factorization for stability of topic modeling

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence