Comparative Study between Parallel K-Means and Parallel K-Medoids with Message Passing Interface (MPI)

Fhira Nhita

doi:10.21108/ijoict.2016.22.86

Abstract

<p>Data mining is a combination technology for analyze a useful information from dataset using some technique such as classification, clustering, and etc. Clustering is one of the most used data mining technique these day. K-Means and K-Medoids is one of clustering algorithms that mostly used because it’s easy implementation, efficient, and also present good results. Besides mining important information, the needs of time spent when mining data is also a concern in today era considering the real world applications produce huge volume of data. This research analyzed the result from K-Means and K-Medoids algorithm and time performance using High Performance Computing (HPC) Cluster to parallelize K-Means and K-Medoids algorithms and using Message Passing Interface (MPI) library. The results shown that K-Means algorithm gives smaller SSE than K-Medoids. And also parallel algorithm that used MPI gives faster computation time than sequential algorithm.</p>

Highlights

Nowadays, data generation advancement are massively and rapidly developed
Just like a research done by Jing Zhang, Gongqing Wu, Xuegang Hu, Shiying Li, Shuilang Hao titled “A Parallel K-means Clustering Algorithm with Message Passing Interface (MPI)”[1]
We propose High Performance Computing (HPC) Cluster approach to implement K-Means and K-Medoids in parallel platform

Summary

INTRODUCTION

Data generation advancement are massively and rapidly developed. Collecting any data is possible everywhere and anywhere. Gathering information and processed into knowledge could be done with data mining technique. Clustering is one of data mining technique. Clustering is a data mining technique which very useful for real problems [9]. Selection of clustering algorithm could be based on fata type or use of data. The problems is how we could process thousands dimensions data with a great accuracy and with a shortest time possible. Just like a research done by Jing Zhang, Gongqing Wu, Xuegang Hu, Shiying Li, Shuilang Hao titled “A Parallel K-means Clustering Algorithm with MPI”[1]. Parallel data clustering using Message Passing Interface (MPI) were done in this research to get a high accuracy and low computational time for clustering result on data mining process

K-MEANS CLUSTERING

K-MEDOIDS CLUSTERING

PARALLEL K-MEANS AND K-MEDOID CLUSTERING

CLUSTER EVALUATION

PARALLEL PERFORMANCE EVALUATION

DATASET

RESEARCH METHOD

Pre-processing Data

CLUSTERING PERFORMANCE

TIME EVALUATION OF SEQUENTIAL COMPUTATION

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal on Information and Communication Technology (IJoICT)	Publication Date: Jul 25, 2017
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Comparative Study between Parallel K-Means and Parallel K-Medoids with Message Passing Interface (MPI)

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal on Information and Communication Technology (IJoICT)

Lead the way for us

Similar Papers

High-Performance Cluster Computing. Volume 1: Architecutes and Systems. Volume 2: Programming and Applications

Scalable Computing Practice and Experience | VOL. 2

01 Jan 1998
Scalable Computing Practice and Experience | VOL. 2

Java thread and process performance for parallel machine learning on multicore HPC clusters
Saliya Ekanayake ... Pulasthi Wickramasinghe
-
Saliya Ekanayake, et. al.Saliya Ekanayake ... Pulasthi Wickramasinghe
01 Dec 2016
01 Dec 2016

Parallel implementation of inverse adding-doubling and Monte Carlo multi-layered programs for high performance computing systems with shared and distributed memory
Svyatoslav Chugunov ... Changying Li
Computer Physics Communications | VOL. 194
Svyatoslav Chugunov, et. al.Svyatoslav Chugunov ... Changying Li
01 Apr 2015
Computer Physics Communications | VOL. 194

Efficient parallel algorithm for listing permutation with Message Passing Interface (MPI)
Sharmila Karim ... Haslinda Ibrahim
-
Sharmila Karim, et. al.Sharmila Karim ... Haslinda Ibrahim
01 May 2015
01 May 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Study between Parallel K-Means and Parallel K-Medoids with Message Passing Interface (MPI)

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal on Information and Communication Technology (IJoICT)