A Fast Multiscale Clustering Approach Based on DBSCAN

Runzi Chen,Meishe Liang,Shuliang Zhao

doi:10.1155/2021/4071177

Abstract

Multiscale brings great benefits for people to observe objects or problems from different perspectives. It has practical significance for clustering on multiscale data. At present, there is a lack of research on the clustering of large‐scale data under the premise that clustering results of small‐scale datasets have been obtained. If one does cluster on large‐scale datasets by using traditional methods, two disadvantages are as follows: (1) Clustering results of small‐scale datasets are not utilized. (2) Traditional method will cause more running overhead. Aims at these shortcomings, this paper proposes a multiscale clustering framework based on DBSCAN. This framework uses DBSCAN for clustering small‐scale datasets, then introduces algorithm Scaling‐Up Cluster Centers (SUCC) generating cluster centers of large‐scale datasets by merging clustering results of small‐scale datasets, not mining raw large‐scale datasets. We show experimentally that, compared to traditional algorithm DBACAN and leading algorithms DBSCAN++ and HDBSCAN, SUCC can provide not only competitive performance but reduce computational cost. In addition, under the guidance of experts, the performance of SUCC is more competitive in accuracy.

Highlights

Clustering is one of the vital data mining and machine learning techniques and that aims to group similar objects into the same cluster and separate dissimilar objects into different clusters [1]
We provide a mathematical model, design a novel algorithm named Scaling-Up Cluster Centers (SUCC) from small scale to large scale, which avoids repetitive clustering on raw datasets
Experimental results show that the SUCC is efficient and reduces runtime consumption with competitive accuracy compared to traditional methods and the leading algorithms, which need to deal with raw data that is much more than cluster centers belonged small-scale data in most instances

Summary

Introduction

Clustering is one of the vital data mining and machine learning techniques and that aims to group similar objects into the same cluster and separate dissimilar objects into different clusters [1]. We concentrate on Scaling-Up Cluster Centers (SUCC) from small scale to large scale and avoid repetitive clustering on raw datasets, with competitive efficiency. It is inefficient that clustering at each scale data by using the traditional method, i.e., reclustering, while SUCC can improve efficiency by computing cluster centers belonged small-scale data and obtaining large scale’s clusters. We provide a mathematical model (multiscale clustering framework), design a novel algorithm named Scaling-Up Cluster Centers (SUCC) from small scale to large scale, which avoids repetitive clustering on raw datasets. Experimental results show that the SUCC is efficient and reduces runtime consumption with competitive accuracy compared to traditional methods and the leading algorithms, which need to deal with raw data that is much more than cluster centers belonged small-scale data in most instances.

Related Work

Problem Description

Proposed Framework

Performance Evaluations

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Wireless Communications and Mobile Computing	Publication Date: Jan 1, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Fast Multiscale Clustering Approach Based on DBSCAN

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing

Lead the way for us

Similar Papers

A Novel Clustering Method Using Enhanced Grey Wolf Optimizer and MapReduce
Ashish Kumar Tripathi ... Kapil Sharma
Big Data Research | VOL. 14
Ashish Kumar Tripathi, et. al.Ashish Kumar Tripathi ... Kapil Sharma
21 May 2018
Big Data Research | VOL. 14

Parallel particle swarm optimization clustering algorithm based on MapReduce methodology
Ibrahim Aljarah ... Simone A Ludwig
-
Ibrahim Aljarah, et. al.Ibrahim Aljarah ... Simone A Ludwig
01 Nov 2012
01 Nov 2012

Fuzzy Rough C-Mean Based Unsupervised CNN Clustering for Large-Scale Image Data
Saman Riaz ... Ali Arshad
Applied Sciences | VOL. 8
Saman Riaz, et. al.Saman Riaz ... Ali Arshad
10 Oct 2018
Applied Sciences | VOL. 8

Intelligent Security Image Classification on Small Sample Learning
Zixian Chen ... Guisheng Yin
-
Zixian Chen, et. al.Zixian Chen ... Guisheng Yin
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Fast Multiscale Clustering Approach Based on DBSCAN

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing