Abstract
Non-negative matrix factorization, which decomposes the input non-negative matrix into product of two non-negative matrices, has been widely used in the neuroimaging field due to its flexible interpretability with non-negativity property. Nowadays, especially in the neuroimaging field, it is common to have at least thousands of voxels while the sample size is only hundreds. The non-negative matrix factorization encounters both computational and theoretical challenge with such high-dimensional data, i.e., there is no guarantee for a sparse and part-based representation of data. To this end, we introduce a co-sparse non-negative matrix factorization method to high-dimensional data by simultaneously imposing sparsity in both two decomposed matrices. Instead of adding some sparsity induced penalty such as l1 norm, the proposed method directly controls the number of non-zero elements, which can avoid the bias issues and thus yield more accurate results. We developed an alternative primal-dual active set algorithm to derive the co-sparse estimator in a computationally efficient way. The simulation studies showed that our method achieved better performance than the state-of-art methods in detecting the basis matrix and recovering signals, especially under the high-dimensional scenario. In empirical experiments with two neuroimaging data, the proposed method successfully detected difference between Alzheimer's patients and normal person in several brain regions, which suggests that our method may be a valuable toolbox for neuroimaging studies.
Highlights
High-dimensional data structures have been available and studied in many areas including neuroimaging (Chén et al, 2018), biology (Bühlmann et al, 2014), signal processing (Shuman et al, 2013), and economics (Fan et al, 2011)
We demonstrated the effectiveness of the proposed method in application to two neuroimaging data from Alzheimer’s Disease Neuroimaging Initiative (ADNI)
The primary goal of ADNI was to test whether serial magnetic resonance imaging (MRI), PET, and other biological markers are useful in clinical trials of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD)
Summary
High-dimensional data structures have been available and studied in many areas including neuroimaging (Chén et al, 2018), biology (Bühlmann et al, 2014), signal processing (Shuman et al, 2013), and economics (Fan et al, 2011). It is more reasonable to have an NMF estimate, where the original data matrix X is factorized into product of two non-negative matrices, i.e., the basis matrix W and the coding matrix H (Anderson et al, 2014). The co-sparsity is realized by limiting the total number of non-zero elements in both two matrices to a rather small number, which enables us to resolve the “curse of dimensionality.” This co-sparsity is similar with the work proposed by Bolte et al (2014), where a proximal alternating linearized minimization algorithm is introduced to implement it.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.