Fast optimization of non-negative matrix tri-factorization.

Andrej Čopar,Marinka Zitnik,Blaž Zupan

doi:10.1371/journal.pone.0217994

Andrej Čopar, Marinka Zitnik + Show 1 more

Open Access

https://doi.org/10.1371/journal.pone.0217994

Copy DOI

Abstract

Non-negative matrix tri-factorization (NMTF) is a popular technique for learning low-dimensional feature representation of relational data. Currently, NMTF learns a representation of a dataset through an optimization procedure that typically uses multiplicative update rules. This procedure has had limited success, and its failure cases have not been well understood. We here perform an empirical study involving six large datasets comparing multiplicative update rules with three alternative optimization methods, including alternating least squares, projected gradients, and coordinate descent. We find that methods based on projected gradients and coordinate descent converge up to twenty-four times faster than multiplicative update rules. Furthermore, alternating least squares method can quickly train NMTF models on sparse datasets but often fails on dense datasets. Coordinate descent-based NMTF converges up to sixteen times faster compared to well-established methods.

Highlights

Extracting patterns from relational data is a key task in natural language processing [1], bioinformatics [2], and digital humanities [3]
We find that traditional multiplicative update rules method has the worst performance
These results indicate that multiplicative update rules, which is the default negative matrix tri-factorization (NMTF) optimization method in many applications, perform substantially worse than alternative optimization methods described in the present study

Summary

Introduction

Extracting patterns from relational data is a key task in natural language processing [1], bioinformatics [2], and digital humanities [3]. We typically represent a relational dataset with a data matrix, encoding, for example, information on document-term frequencies, gene-disease associations, or user-item ratings. Non-negative matrix tri-factorization (NMTF) is a general technique that takes a data matrix and compresses, or embeds, the matrix into a compact latent space. The learned embedding space can be used to identify clusters [4, 5], reveal interesting patterns [6, 7], and generate feature representations for downstream analytics [8, 9]. Identify cancer driver genes from patient data [11], and to model topics in text data [12]. Despite numerous applications, training NMTF models on large datasets can be slow and has remained computationally challenging [13]

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Jun 11, 2019
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Fast optimization of non-negative matrix tri-factorization.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Co-clustering under Nonnegative Matrix Tri-Factorization
Lazhar Labiod ... Mohamed Nadif
-
Lazhar Labiod, et. al.Lazhar Labiod ... Mohamed Nadif
01 Jan 2010
01 Jan 2010

Two-dimensional data partitioning for non-negative matrix tri-factorization
Jiaxing Yan ... Fu Lee Wang
Big Data Research | VOL. 37
Jiaxing Yan, et. al.Jiaxing Yan ... Fu Lee Wang
19 Jun 2024
Big Data Research | VOL. 37

A unified global convergence analysis of multiplicative update rules for nonnegative matrix factorization
Norikazu Takahashi ... Jun’Ichi Takeuchi
Computational Optimization and Applications | VOL. 71
Norikazu Takahashi, et. al.Norikazu Takahashi ... Jun’Ichi Takeuchi
15 Mar 2018
Computational Optimization and Applications | VOL. 71

Robust orthogonal nonnegative matrix tri-factorization for data representation
Siyuan Peng ... Zhiping Lin
Knowledge-Based Systems | VOL. 201-202
Siyuan Peng, et. al.Siyuan Peng ... Zhiping Lin
22 May 2020
Knowledge-Based Systems | VOL. 201-202

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast optimization of non-negative matrix tri-factorization.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one