Large-Scale Spectral Clustering Based on Representative Points

Libo Yang,Feiping Nie,Xuemei Liu,Mingtang Liu

doi:10.1155/2019/5864020

Abstract

Spectral clustering (SC) has attracted more and more attention due to its effectiveness in machine learning. However, most traditional spectral clustering methods still face challenges in the successful application of large-scale spectral clustering problems mainly due to their high computational complexity οn3, where n is the number of samples. In order to achieve fast spectral clustering, we propose a novel approach, called representative point-based spectral clustering (RPSC), to efficiently deal with the large-scale spectral clustering problem. The proposed method first generates two-layer representative points successively by BKHK (balanced k-means-based hierarchical k-means). Then it constructs the hierarchical bipartite graph and performs spectral analysis on the graph. Specifically, we construct the similarity matrix using the parameter-free neighbor assignment method, which avoids the need to tune the extra parameters. Furthermore, we perform the coclustering on the final similarity matrix. The coclustering mechanism takes advantage of the cooccurring cluster structure among the representative points and the original data to strengthen the clustering performance. As a result, the computational complexity can be significantly reduced and the clustering accuracy can be improved. Extensive experiments on several large-scale data sets show the effectiveness, efficiency, and stability of the proposed method.

Highlights

Clustering is one of the fundamental topics in unsupervised learning
Zhao et al proposed a spectral clustering based on iterative optimization (SCIO), which solves the spectral decomposition problem of largescale and high-dimensional data set, and this method performs on multitask clustering [19]. e nonnegative matrix factorization (NMF) has been proposed as the relaxation technique for clustering with excellent performance [20, 21]
We proposed a novel representative point-based spectral clustering approach, named RPSC, based on the twolayer bipartite graph

Summary

Introduction

Clustering is one of the fundamental topics in unsupervised learning. It has been widely and successfully applied in data mining, pattern recognition, and many other fields. E traditional spectral clustering needs two independent steps: constructing similarity graph and performing spectral analysis [12] Both the steps are computational expensive for large-scale data, and their computational complexity is o(n2) and o(n3), respectively. Zhao et al proposed a spectral clustering based on iterative optimization (SCIO), which solves the spectral decomposition problem of largescale and high-dimensional data set, and this method performs on multitask clustering [19]. Liu et al [28] proposed an efficient cluster algorithm for large-scale graph data using spectral methods. Ese methods mentioned above adopt representative point-based strategy to construct the similarity graph to accelerate the procedure of spectral clustering. A novel and efficient representative point-based spectral clustering method is proposed to deal with large-scale data sets. We can obtain the first-layer representative points by performing above process iteratively. en the procedure is repeated on the first-layer representative points to generate the second-layer representative points

Similarity Matrix

Coclustering on Similarity Matrix

Experiments

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Problems in Engineering	Publication Date: Dec 9, 2019
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Large-Scale Spectral Clustering Based on Representative Points

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

Divide-and-conquer based large-scale spectral clustering
Hongmin Li ... Tetsuya Sakurai
Neurocomputing | VOL. 501
Hongmin Li, et. al.Hongmin Li ... Tetsuya Sakurai
09 Jun 2022
Neurocomputing | VOL. 501

RESKM: A General Framework to Accelerate Large-Scale Spectral Clustering
Geping Yang ... Zhifeng Hao
Pattern Recognition | VOL. 137
Geping Yang, et. al.Geping Yang ... Zhifeng Hao
26 Dec 2022
Pattern Recognition | VOL. 137

Sparse-reduced computation for large-scale spectral clustering
Philipp Baumann
-
Philipp BaumannPhilipp Baumann
01 Dec 2016
01 Dec 2016

Unsupervised classification of polarimetric SAR imagery using large-scale spectral clustering with spatial constraints
H Song ... X Xu
International Journal of Remote Sensing | VOL. 36
H Song, et. al.H Song ... X Xu
01 Jun 2015
International Journal of Remote Sensing | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Large-Scale Spectral Clustering Based on Representative Points

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering