Abstract

Clustering is the process of grouping similar data objects into different subsets based on their similarities. Inspired by the concept of the popularity of individuals in a community, we rate the popularity of each sample which reflects the centrality of that sample in the dataset. With the aim of identifying clusters with arbitrary shapes and varying densities, we propose a clustering approach that divides samples into separate population groups. This approach is based on identifying the backbone of data, characterized by a set of popular points surrounded by less popular points. To distinguish poorly separated clusters, a proximity measure is defined based on the popularity of samples. We also use the popularity of samples to assign halo points to clusters and calculate cohesion between clusters. The proposed clustering method can detect arbitrary-shaped clusters with varying densities without requiring to specify the number of clusters. Outliers are also identified according to popularity. We demonstrate the effectiveness of the approach on synthetic and real-world datasets.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.