Abstract

Data clustering, which aims to divide the given samples into several different groups, has drawn much attention in recent years. As a powerful tool, non-negative matrix factorization (NMF) has been applied successfully in clustering tasks. However, there are still two main limitations. First, the original NMF treats equally both noisy and clean data, which leads to high sensitivity to noises and outliers. Second, the performance of graph-based NMFs highly depends on the input graph, that is, if a low-quality graph is constructed to regularize NMF, the clustering results will be bad. To address the above-mentioned problems, we propose a novel robust adaptive graph regularized non-negative matrix factorization (RAGNMF) for data clustering. To be specific, we develop a robust weighted NMF (RWNMF) that can assign small weights to noises and outliers and large weights to clean data. Thus, the robustness of NMF is improved. Moreover, in the process of matrix factorization, metric learning is combined to choose some discriminative features and compute more appropriate distances of samples. Then, an adaptive graph is learned to well regularize the NMF. The experimental results demonstrate that the proposed RAGNMF can achieve better clustering performance then most of the state-of-the-art methods.

Highlights

  • Clustering is one of the most important tasks in the fields of data mining and machine learning [1], [2]

  • 2) The clustering results of 2,1-negative matrix factorization (NMF), 2,0.5-NMF, RMNMF, and the proposed robust weighted NMF (RWNMF), RAGNMF are better than the standard NMF

  • 2,1-NMF and RMNMF use 2,1 norm to improve the robustness of NMF, while the proposed RWNMF and RAGNMF employ the weighted Frobenius norm to decrease the sensitivity to noises and outliers

Read more

Summary

Introduction

Clustering is one of the most important tasks in the fields of data mining and machine learning [1], [2]. The goal of clustering is to partition them into multiple different groups. Plenty of clustering methods have been proposed, such as k-means [3], spectral clustering [4], support vector clustering [5], non-negative matrix factorization (NMF) [6]–[9]. NMF is proposed originally in the influential work [6]. Different from most matrix factorization methods, NMF guarantees the non-negativity on the resulting matrices. NMF has been successfully applied in most fields, e.g., document clustering [10], face recognition [11], hyperspectral unmixing [12]

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.