Fast Clustering by Affinity Propagation Based on Density Peaks

Yang Li,Leilei Sun,Chonghui Guo

doi:10.1109/access.2020.3012740

Yang Li, Leilei Sun + Show 1 more

Open Access

https://doi.org/10.1109/access.2020.3012740

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 33	License type: CC BY 4.0

Affiliation: Dalian University of Technology, Beihang University

Abstract

Clustering is an important technique in data mining and knowledge discovery. Affinity propagation clustering (AP) and density peaks and distance-based clustering (DDC) are two significant clustering algorithms proposed in 2007 and 2014 respectively. The two clustering algorithms have simple and clear design ideas, and are effective in finding meaningful clustering solutions. They have been widely used in various applications successfully. However, a key disadvantage of AP is its high time complexity, which has become a bottleneck when applying AP for large-scale problems. The core idea of DDC is to construct the decision graph based on the local density and the distance of each data point, and then select the cluster centers, but the selection of the cluster centers is relatively subjective, and sometimes it is difficult to determine a suitable number of cluster centers. Here, we propose a two-stage clustering algorithm, called DDAP, to overcome these shortcomings. First, we select a small number of potential exemplars based on the two quantities of each data point in DDC to greatly compress the scale of the similarity matrix. Then we implement message-passing on the incomplete similarity matrix. In experiments, two synthetic datasets, nine publicly available datasets, and a real-world electronic medical records (EMRs) dataset are used to evaluate the proposed method. The results demonstrate that DDAP can achieve comparable clustering performance with the original AP algorithm, while the computational efficiency improves observably.

Highlights

Clustering is a discovery process that groups a set of data such that the intracluster similarity is maximized and the intercluster similarity is minimized [1]
We propose a two-stage fast Affinity propagation clustering (AP) clustering algorithm DDAP, which can largely improve the efficiency of the AP algorithm while achieving comparable clustering performance with the original AP
The results demonstrate that DDAP can achieve comparable clustering performances with the original AP algorithm, while the computational efficiency improves observably

Summary

Introduction

Clustering is a discovery process that groups a set of data such that the intracluster similarity is maximized and the intercluster similarity is minimized [1]. Clustering is used for two aims: (a) receiving a primary understanding of raw data and (b) reducing the size of a huge amount of raw data [2]. Because of the importance of clustering, a large number of clustering algorithms have been proposed and applied widely in many domains [3], [4]. Affinity propagation clustering (AP) [5] and density peaks and distance-based clustering (DDC) [6] are two significant clustering algorithms proposed in 2007 and 2014 respectively. The implementation of an exemplar-based clustering is to find some representative data points called exemplars as centers and assign the remaining data points to their nearest centers [7].

Objectives

Methods

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast Clustering by Affinity Propagation Based on Density Peaks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Automatic Density Peaks Clustering Using DNA Genetic Algorithm Optimized Data Field and Gaussian Process
Wenke Zang ... Xiyu Liu
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 31
Wenke Zang, et. al.Wenke Zang ... Xiyu Liu
09 May 2017
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 31

Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors
Juanying Xie ... Philip W Grant
Information Sciences | VOL. 354
Juanying Xie, et. al.Juanying Xie ... Philip W Grant
12 Mar 2016
Information Sciences | VOL. 354

Shared-nearest-neighbor-based clustering by fast search and find of density peaks
Rui Liu ... Xiaomei Yu
Information Sciences | VOL. 450
Rui Liu, et. al.Rui Liu ... Xiaomei Yu
20 Mar 2018
Information Sciences | VOL. 450

An Affinity Propagation Clustering Method Using Hybrid Kernel Function With LLE
Lin Sun ... Shiguang Zhang
IEEE Access | VOL. 6
Lin Sun, et. al.Lin Sun ... Shiguang Zhang
01 Jan 2018
IEEE Access | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast Clustering by Affinity Propagation Based on Density Peaks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access