Research on Parallel DBSCAN Algorithm Design Based on MapReduce

Yan Xiang Fu,Wei Zhong Zhao,Hui Fang Ma

doi:10.4028/www.scientific.net/amr.301-303.1133

Research on Parallel DBSCAN Algorithm Design Based on MapReduce

Yan Xiang Fu, Wei Zhong Zhao + Show 1 more

https://doi.org/10.4028/www.scientific.net/amr.301-303.1133

Copy DOI

Journal: Advanced materials research	Publication Date: Jul 1, 2011
Citations: 28

Affiliation: Xiangtan University, Northwest Normal University

#Parallel Clustering Algorithm #Parallel DBSCAN + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Data clustering has been received considerable attention in many applications, such as data mining, document retrieval, image segmentation and pattern classification. The enlarging volumes of information emerging by the progress of technology, makes clustering of very large scale of data a challenging task. In order to deal with the problem, more researchers try to design efficient parallel clustering algorithms. In this paper, we propose a parallel DBSCAN clustering algorithm based on Hadoop, which is a simple yet powerful parallel programming platform. The experimental results demonstrate that the proposed algorithm can scale well and efficiently process large datasets on commodity hardware.

Full Text