An improvement method of DBSCAN algorithm on cloud computing

Weipeng Jing,Chuanyu Zhao,Chao Jiang

doi:10.1016/j.procs.2019.01.208

Weipeng Jing, Chuanyu Zhao + Show 1 more

Open Access

https://doi.org/10.1016/j.procs.2019.01.208

Copy DOI

Journal: Procedia Computer Science	Publication Date: Jan 1, 2019
Citations: 10	License type: cc-by-nc-nd

Affiliation: Northeast Forestry University

Abstract

DBSCAN is a density-based data clustering algorithm, in image processing, data mining, machine learning and other fields are widely used. With the increasing of the size of clusters, the parallel DBSCAN algorithm is widely used. in image processing, data mining, machine learning and other fields are widely used. However, we consider current partitioning method of DBSCAN is too simple and steps of GETNEIGHBORS query repeatedly access the data set on spark. So we proposed DBSCAN-PSM which applies new data partitioning and merging method. In the first stage of our method we import the KD-Tree, combine the partitioning and GETNEIGHBORS query, reduce the number of access to the data set and decrease the influence of I/O in the algorithm. In the second stage of our method we use the feature of points in merging so as to avoid the time costing of the global label. Experimental results showed that our new method can improve the parallel efficiency and the clustering algorithm performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An improvement method of DBSCAN algorithm on cloud computing

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

DBSCAN-PSM: an improvement method of DBSCAN algorithm on Spark
Weipeng Jing ... Yiqun Cheng
International Journal of High Performance Computing and Networking | VOL. 13
Weipeng Jing, et. al.Weipeng Jing ... Yiqun Cheng
01 Jan 2019
International Journal of High Performance Computing and Networking | VOL. 13

DBSCAN-PSM: an improvement method of DBSCAN algorithm on Spark
Guangsheng Chen ... Yiqun Cheng
International Journal of High Performance Computing and Networking | VOL. 13
Guangsheng Chen, et. al.Guangsheng Chen ... Yiqun Cheng
01 Jan 2019
International Journal of High Performance Computing and Networking | VOL. 13

A Parallel DBSCAN Algorithm Based on Spark
Guangchun Luo ... Thomas Fairley Gooch
-
Guangchun Luo, et. al.Guangchun Luo ... Thomas Fairley Gooch
01 Oct 2016
01 Oct 2016

Approaches for scaling DBSCAN algorithm to large spatial databases
Aoying Zhou ... Yunfa Hu
Journal of Computer Science and Technology | VOL. 15
Aoying Zhou, et. al.Aoying Zhou ... Yunfa Hu
01 Nov 2000
Journal of Computer Science and Technology | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An improvement method of DBSCAN algorithm on cloud computing

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science