Optimization and expansion of non-negative matrix factorization

Xihui Lin,Paul C Boutros

doi:10.1186/s12859-019-3312-5

Abstract

BackgroundNon-negative matrix factorization (NMF) is a technique widely used in various fields, including artificial intelligence (AI), signal processing and bioinformatics. However existing algorithms and R packages cannot be applied to large matrices due to their slow convergence or to matrices with missing entries. Besides, most NMF research focuses only on blind decompositions: decomposition without utilizing prior knowledge. Finally, the lack of well-validated methodology for choosing the rank hyperparameters also raises concern on derived results.ResultsWe adopt the idea of sequential coordinate-wise descent to NMF to increase the convergence rate. We demonstrate that NMF can handle missing values naturally and this property leads to a novel method to determine the rank hyperparameter. Further, we demonstrate some novel applications of NMF and show how to use masking to inject prior knowledge and desirable properties to achieve a more meaningful decomposition.ConclusionsWe show through complexity analysis and experiments that our implementation converges faster than well-known methods. We also show that using NMF for tumour content deconvolution can achieve results similar to existing methods like ISOpure. Our proposed missing value imputation is more accurate than conventional methods like multiple imputation and comparable to missForest while achieving significantly better computational efficiency. Finally, we argue that the suggested rank tuning method based on missing value imputation is theoretically superior to existing methods. All algorithms are implemented in the R package NNLM, which is freely available on CRAN and Github.

Highlights

Non-negative matrix factorization (NMF) is a technique widely used in various fields, including artificial intelligence (AI), signal processing and bioinformatics
The main difference between NMF and other factorization methods, such as SVD, is the nonnegativity, which allows only additive combinations of intrinsic ‘parts’, i.e. the hidden features. This is demonstrated in [1], where NMF learns parts of faces and a face is naturally repsuggested that the trinucleotide profile of each cancer type is a positive linear combination of these signatures [4]
alternating nonnegative least square (ANLS) is gaining attention resented as an additive linear combination of different parts

Summary

Introduction

Non-negative matrix factorization (NMF) is a technique widely used in various fields, including artificial intelligence (AI), signal processing and bioinformatics. Non-negative matrix factorization (NMF or NNMF) [1] has been widely used as a general method for dimensional related to some biological pathways [2, 3]. The main difference between NMF and other factorization methods, such as SVD, is the nonnegativity, which allows only additive combinations of intrinsic ‘parts’, i.e. the hidden features. This is demonstrated in [1], where NMF learns parts of faces and a face is naturally repsuggested that the trinucleotide profile of each cancer type is a positive linear combination of these signatures [4]. Negative combinations are not as intuitive or natural as positive combinations

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jan 6, 2020
Citations: 69	License type: open-access

R Discovery Prime

R Discovery Prime

Optimization and expansion of non-negative matrix factorization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

What is missing from my missing data plan?
Sharon D Yeatts ... Renée H Martin
Stroke | VOL. 46
Sharon D Yeatts, et. al.Sharon D Yeatts ... Renée H Martin
07 May 2015
Stroke | VOL. 46

Active learning with missing values considering imputation uncertainty
Jongmin Han ... Seokho Kang
Knowledge-Based Systems | VOL. 224
Jongmin Han, et. al.Jongmin Han ... Seokho Kang
26 Apr 2021
Knowledge-Based Systems | VOL. 224

Imputation of missing values of tumour stage in population-based cancer registration
Nora Eisemann ... Alexander Katalinic
BMC Medical Research Methodology | VOL. 11
Nora Eisemann, et. al.Nora Eisemann ... Alexander Katalinic
19 Sep 2011
BMC Medical Research Methodology | VOL. 11

A Comparison of Multiple Imputation Methods for Data with Missing Values
Geeta Chhabra ... Jayanthi Ranjan
Indian Journal of Science and Technology | VOL. 10
Geeta Chhabra, et. al.Geeta Chhabra ... Jayanthi Ranjan
18 May 2017
Indian Journal of Science and Technology | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization and expansion of non-negative matrix factorization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics