Robust landmark graph-based clustering for high-dimensional data

Ben Yang,Jinghan Wu,Aoran Sun,Naying Gao,Xuetao Zhang

doi:10.1016/j.neucom.2022.05.011

Abstract

High-dimensional data has attracted much attention because it contains more comprehensive information about samples. How to cluster these high-dimensional data has become a crucial topic in unsupervised learning. Existing clustering methods often show limited applicability due to their high computational complexity and low anti-noise ability. To address this issue, we propose a novel robust landmark graph-based clustering algorithm for high-dimensional data (RLGCH), which inherits the advantages of both k-means++ and graph-based clustering by using the results of k-means++ as pseudo labels for landmark graph-based clustering. In particular, RLGCH can achieve more reasonable clustering effectiveness than methods that just operate in the low-dimensional space or the original space since it performs k-means++ in the low-dimensional space and landmark graph-based spectral clustering in the original feature space. To avoid post-processing after optimization, the embedded factor matrix is constrained as an indicator matrix rather than a simple nonnegative matrix. To enhance the clustering robustness, the L2,1-norm is adopted to minimize the error of results between k-means++ and landmark graph-based clustering. To solve the model of RLGCH, we established a novel efficient optimization strategy to obtain all sample categories directly. Combining our clustering model and optimization strategy, the computational complexity is reduced to linear and insensitive to data dimensions. Extensive experiments on seven real-world datasets and sixteen noisy datasets show that compared with other state-of-the-art methods, RLGCH can improve the clustering efficiency and robustness greatly while guaranteeing comparable or even better clustering effectiveness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust landmark graph-based clustering for high-dimensional data

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: May 6, 2022
Citations: 4

Similar Papers

Sparse multi-view image clustering with complete similarity information
Shuaiyong Li ... Sai Zhang
Neurocomputing | VOL. 596
Shuaiyong Li, et. al.Shuaiyong Li ... Sai Zhang
01 Jun 2024
Neurocomputing | VOL. 596

Broad Graph-Based Non-Negative Robust Continuous Clustering
Qiying Feng ... C L Philip Chen
IEEE Access | VOL. 8
Qiying Feng, et. al.Qiying Feng ... C L Philip Chen
01 Jan 2020
IEEE Access | VOL. 8

Fast and robust K-means clustering via feature learning on high-dimensional data
Xiao-Dong Wang ... Rung-Ching Chen
-
Xiao-Dong Wang, et. al.Xiao-Dong Wang ... Rung-Ching Chen
01 Nov 2017
01 Nov 2017

GFEL: Generalized Feature Embedding Learning Using Weighted Instance Matching
Eric Golinko ... Xingquan Zhu
-
Eric Golinko, et. al.Eric Golinko ... Xingquan Zhu
01 Aug 2017
01 Aug 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust landmark graph-based clustering for high-dimensional data

Abstract

Talk to us

Similar Papers

More From: Neurocomputing