Improved k-Means Clustering Algorithm for Big Data Based on Distributed SmartphoneNeural Engine Processor

Fouad H Awad,Murtadha M Hamad

doi:10.3390/electronics11060883

Abstract

Clustering is one of the most significant applications in the big data field. However, using the clustering technique with big data requires an ample amount of processing power and resources due to the complexity and resulting increment in the clustering time. Therefore, many techniques have been implemented to improve the performance of the clustering algorithms, especially for k-means clustering. In this paper, the neural-processor-based k-means clustering technique is proposed to cluster big data by accumulating the advantage of dedicated machine learning processors of mobile devices. The solution was designed to be run with a single-instruction machine processor that exists in the mobile device’s processor. Running the k-means clustering in a distributed scheme run based on mobile machine learning efficiently can handle the big data clustering over the network. The results showed that using a neural engine processor on a mobile smartphone device can maximize the speed of the clustering algorithm, which shows an improvement in the performance of the cluttering up to two-times faster compared with traditional laptop/desktop processors. Furthermore, the number of iterations that are required to obtain (k) clusters was improved up to two-times faster than parallel and distributed k-means.

Highlights

Thousands of clustering algorithms have been published based on this concept, and k-means is one of the most used. k-means is widely used with a wide range of applications due to its simplicity of implementation and its effectiveness
This paper proposes an efficient and high-performance solution to improve the kmeans clustering by: 1. Maximizing the performance of the k-means algorithm by running it on the dedicated neural engine processor of smart mobile devices by editing the code and steps of the kmeans algorithm to run on the single-instruction-based machine with an ARM-based processor; 2
4 9 12 22 In Table 5, several iterations are fixed in the case of the parallel neural k-means algorithm using the education sector dataset, i.e., for k = 4, 5, 6, 7, but this kept changing from one run to another in the case of the parallel k-means clustering algorithm with multiple running times

Summary

Introduction

We are in a data flood era, as proven by the massive amounts of continuously generated data at unprecedented and ever-increasing scales. Machine learning techniques have become increasingly popular in a wide range of large and complex data-intensive applications, such as astronomy, as well as medicine, biology, and other sciences [1]. These strategies offer potential options for extracting hidden information from the data.

Big Data Clustering

Multi-Machine Clustering

Big Data Platform

Related Work

Proposed Solution

Proposed Solution Processing

Complexity

Analysis of the Experiment Results

Neural Engine Performance

Number of Iterations

Results

Multiple Cores and Multiple Processors

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Mar 11, 2022
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improved k-Means Clustering Algorithm for Big Data Based on Distributed SmartphoneNeural Engine Processor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Chapter 27 - D-PPSOK clustering algorithm with data sampling for clustering big data analysis
C Suresh Gnana Dhas ... Tadele Degefa Geleto
System Assurances | VOL. -
C Suresh Gnana Dhas, et. al.C Suresh Gnana Dhas ... Tadele Degefa Geleto
01 Jan 2021
System Assurances | VOL. -

A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapReduce Capability
Kamlesh Kumar Pandey ... Ram Milan
-
Kamlesh Kumar Pandey, et. al.Kamlesh Kumar Pandey ... Ram Milan
01 Jan 2020
01 Jan 2020

A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis
Adil Fahad ... Najlaa Alshatri
IEEE Transactions on Emerging Topics in Computing | VOL. 2
Adil Fahad, et. al.Adil Fahad ... Najlaa Alshatri
01 Sep 2014
IEEE Transactions on Emerging Topics in Computing | VOL. 2

Application of Parallel Clustering Algorithms for Big Data in the Division of Stock
...
Big Data Research | VOL. 1
, et. al. ...
22 Dec 2015
Big Data Research | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved k-Means Clustering Algorithm for Big Data Based on Distributed SmartphoneNeural Engine Processor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics