Abstract

Non-negative matrix factorization, which decomposes the input non-negative matrix into product of two non-negative matrices, has been widely used in the neuroimaging field due to its flexible interpretability with non-negativity property. Nowadays, especially in the neuroimaging field, it is common to have at least thousands of voxels while the sample size is only hundreds. The non-negative matrix factorization encounters both computational and theoretical challenge with such high-dimensional data, i.e., there is no guarantee for a sparse and part-based representation of data. To this end, we introduce a co-sparse non-negative matrix factorization method to high-dimensional data by simultaneously imposing sparsity in both two decomposed matrices. Instead of adding some sparsity induced penalty such as l1 norm, the proposed method directly controls the number of non-zero elements, which can avoid the bias issues and thus yield more accurate results. We developed an alternative primal-dual active set algorithm to derive the co-sparse estimator in a computationally efficient way. The simulation studies showed that our method achieved better performance than the state-of-art methods in detecting the basis matrix and recovering signals, especially under the high-dimensional scenario. In empirical experiments with two neuroimaging data, the proposed method successfully detected difference between Alzheimer's patients and normal person in several brain regions, which suggests that our method may be a valuable toolbox for neuroimaging studies.

Highlights

  • High-dimensional data structures have been available and studied in many areas including neuroimaging (Chén et al, 2018), biology (Bühlmann et al, 2014), signal processing (Shuman et al, 2013), and economics (Fan et al, 2011)

  • We demonstrated the effectiveness of the proposed method in application to two neuroimaging data from Alzheimer’s Disease Neuroimaging Initiative (ADNI)

  • The primary goal of ADNI was to test whether serial magnetic resonance imaging (MRI), PET, and other biological markers are useful in clinical trials of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD)

Read more

Summary

INTRODUCTION

High-dimensional data structures have been available and studied in many areas including neuroimaging (Chén et al, 2018), biology (Bühlmann et al, 2014), signal processing (Shuman et al, 2013), and economics (Fan et al, 2011). It is more reasonable to have an NMF estimate, where the original data matrix X is factorized into product of two non-negative matrices, i.e., the basis matrix W and the coding matrix H (Anderson et al, 2014). The co-sparsity is realized by limiting the total number of non-zero elements in both two matrices to a rather small number, which enables us to resolve the “curse of dimensionality.” This co-sparsity is similar with the work proposed by Bolte et al (2014), where a proximal alternating linearized minimization algorithm is introduced to implement it.

Co-sparse Non-negative Matrix Factorization
Algorithm
5: Update active set and inactive set by
Comparison With Existing Methods
SYNTHETIC EXPERIMENT
Simulation I
Simulation II
Simulation III
EMPIRICAL EXPERIMENT
MRI Data
Functional MRI Data
Method
DISCUSSION
DATA AVAILABILITY STATEMENT
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call