Adaptive Graph [formula omitted]-Means

Shenfei Pei,Yuanchen Sun,Feiping Nie,Xudong Jiang,Zengwei Zheng

doi:10.1016/j.patcog.2024.111226

Abstract

Clustering large-scale datasets has received increasing attention recently. However, existing algorithms are still not efficient in scenarios with extremely large number of clusters. To this end, Adaptive Graph K-Means (AGKM) is proposed in this work. Its idea originates from k-means, but it operates on an adaptive k-Nearest Neighbor (k-NN) graph instead of data features. First, AGKM is highly efficient for processing datasets where both the numbers of samples and clusters are very large. Specifically, the time and space complexity are both linear w.r.t the number of samples and, more importantly, independent to the cluster number. Second, AGKM is designed for balanced clusters. This constraint is realized by adding a regularization term in loss function, and a simple modification of the graph in optimization algorithm, which does not increase the computational burden. Last, the indicator and dissimilarity matrices are learned simultaneously, so that the proposed AGKM obtains the final partition directly with higher efficacy and efficiency. Experiments on several datasets validate the advantages of AGKM. In particular, over 29X and 46X speed-ups with respect to k-means are observed on the two large-scale datasets WebFace and CelebA, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Graph [formula omitted]-Means

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

Shape-Aware Deep Convolutional Neural Network for Vertebrae Segmentation
S M Masudur Rahman Al Arif ... Greg Slabaugh
-
S M Masudur Rahman Al Arif, et. al.S M Masudur Rahman Al Arif ... Greg Slabaugh
01 Jan 2018
01 Jan 2018

An Ensemble Wasserstein Generative Adversarial Network Method for Road Extraction From High Resolution Remote Sensing Images in Rural Areas
Chuan Yang ... Zhenghong Wang
IEEE Access | VOL. 8
Chuan Yang, et. al.Chuan Yang ... Zhenghong Wang
01 Jan 2020
IEEE Access | VOL. 8

A Flexible and Noise-insensitive Sparse Regression Based Feature Selection Algorithm
Xiaobin Zhi ... Yuexuan Wu
-
Xiaobin Zhi, et. al.Xiaobin Zhi ... Yuexuan Wu
01 Mar 2022
01 Mar 2022

Visualizing large-scale high-dimensional data via hierarchical embedding of KNN graphs
Haiyang Zhu ... Wei Chen
Visual Informatics | VOL. 5
Haiyang Zhu, et. al.Haiyang Zhu ... Wei Chen
01 Jun 2021
Visual Informatics | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Graph [formula omitted]-Means

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition