Large dataset partitioning using ensemble partition-based clustering with majority voting technique

Vunnava Dinesh Babu,Karunakaran Malathi

doi:10.11591/ijeecs.v29.i2.pp838-844

Vunnava Dinesh Babu, Karunakaran Malathi

Open Access

https://doi.org/10.11591/ijeecs.v29.i2.pp838-844

Copy DOI

Abstract

<span lang="EN-US">Large datasets have become useful in data mining for processing, storing, and handling vast amounts of data. However, handling and processing large datasets is time-consuming and memory intensive. As a result, the researchers adopted a partitioning strategy to improve controllability and performance and reduce the time and memory required to handle large datasets. Unfortunately, the numerous clustering techniques available in the literature could confuse experts in choosing the best techniques for a given dataset. Furthermore, no clustering technique can tackle all problems, such as cluster structure, noise, or density. To manage large datasets, existing clustering techniques need scalable solutions. Therefore, this paper proposes an ensemble partition-based clustering with a majority voting technique for large dataset partitioning using the aggregation of k-means, k-medoids, fuzzy c-means, expectation-maximization (EM) and density-based spatial clustering of applications with noise (DBSCAN) techniques. These techniques cluster the large dataset individually in the first stage. The final clusters are discovered in the next stage through a majority voting technique among the five clustering algorithms. These five clustering algorithms assigned data instances to the cluster with the most votes. The experimental findings demonstrate that the ensemble partition-based clustering method surpasses the other five clustering algorithms in terms of execution time and accuracy.</span>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Large dataset partitioning using ensemble partition-based clustering with majority voting technique

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science

Lead the way for us

Journal: Indonesian Journal of Electrical Engineering and Computer Science	Publication Date: Feb 1, 2023
License type: CC BY-NC 4.0

Similar Papers

Implementation of Density-Based Spatial Clustering of Applications with Noise and Fuzzy C – Means for Clustering Car Sales
Sephia Nazwa Auliani
The Indonesian Journal of Computer Science | VOL. 13
Sephia Nazwa AulianiSephia Nazwa Auliani
25 Jul 2024
The Indonesian Journal of Computer Science | VOL. 13

Towards Clustering of Mobile and Smartwatch Accelerometer Data for Physical Activity Recognition
Chelsea Dobbins ... Reza Rawassizadeh
Informatics | VOL. 5
Chelsea Dobbins, et. al.Chelsea Dobbins ... Reza Rawassizadeh
12 Jun 2018
Informatics | VOL. 5

AMF-IDBSCAN: Incremental Density Based Clustering Algorithm Using Adaptive Median Filtering Technique
Aida Chefrour ... Labiba Souici-Meslati
Informatica | VOL. 43
Aida Chefrour, et. al.Aida Chefrour ... Labiba Souici-Meslati
15 Dec 2019
Informatica | VOL. 43

Clustering Techniques in Bioinformatics
Muhammad Ali Masood ... M N A Khan
International Journal of Modern Education and Computer Science | VOL. 7
Muhammad Ali Masood, et. al.Muhammad Ali Masood ... M N A Khan
08 Jan 2015
International Journal of Modern Education and Computer Science | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Large dataset partitioning using ensemble partition-based clustering with majority voting technique

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science