Multidimensional Discrete Big Data Clustering Algorithm Based on Dynamic Grid

Xiaolei Li

doi:10.1155/2022/4663816

Abstract

Traditionally, the data clustering algorithm is lack of comprehensive performance, leading to low clustering purity and long clustering time. In addition, the consistency between the clustering results and the original data distribution is not strong. Therefore, the multidimensional discrete big data clustering algorithm based on dynamic grid was put forward. Firstly, multidimensional discrete big data was processed in advance. The principal component analysis was used to reduce the dimension of data. The concept of entropy was introduced to divide the key attributes and noncritical attributes, so as to extract the key attributes. According to the results of data preprocessing, the dynamic grid was partitioned. According to the results, OptiGrid in data clustering algorithm was used to achieve the data clustering. The experimental results show that the clustering purity of this algorithm is between 95% and 100%, which is significantly higher than the traditional algorithm. Therefore, the multidimensional discrete big data clustering algorithm based on dynamic grid has better comprehensive performance, closer clustering shape to the original data distribution, higher clustering purity, and faster execution efficiency.

Highlights

Due to the shortcomings in above methods, a multidimensional discrete big data clustering algorithm based on dynamic grid was put forward
The results show that the proposed algorithm is effective in solving their own problems, so it has higher comprehensive performance
In order to verify the effectiveness of multidimensional discrete big data clustering algorithm based on dynamic grid, the clustering shape, efficiency, and accuracy of proposed algorithm was compared with the data clustering methods in Reference [2], Reference [3], and Reference [4] through experiments, and the results analysis was given

Summary

Introduction

With the rapid development of information technology, Internet and cloud computing, the amount of information is increasing explosively. Reference [2] proposed a data clustering method based on K-means algorithm. This method extracted a lot of data samples from massive data. In Reference [3], a data clustering method based on rapid regional evolution was proposed. This method was able to reduce the dimension of data. Due to the shortcomings in above methods, a multidimensional discrete big data clustering algorithm based on dynamic grid was put forward. This algorithm divides the grid in neighborhood of each dimension by the data points, and dynamically adjusts the grid structure. The results show that the proposed algorithm is effective in solving their own problems, so it has higher comprehensive performance

Overall Flow of Multidimensional Discrete Big Data Clustering Algorithm Based on

Multidimensional Discrete Big Data Processing

Dimension Reduction

Attribute Extraction

Dynamic Grid Generation

OptiGrid Data Clustering

Experimental Test Analysis

Experimental Environment and Data Set

Synthetic Data Set

Data Set in Real Environment

Cluster Shape

Cluster Purity

Execution Efficiency

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Wireless Communications and Mobile Computing	Publication Date: Mar 12, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multidimensional Discrete Big Data Clustering Algorithm Based on Dynamic Grid

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing

Lead the way for us

Similar Papers

Retracted: Multidimensional Discrete Big Data Clustering Algorithm Based on Dynamic Grid
Wireless Communications And Mobile Computing
Wireless Communications and Mobile Computing | VOL. 2023
Wireless Communications And Mobile ComputingWireless Communications And Mobile Computing
28 Jun 2023
Wireless Communications and Mobile Computing | VOL. 2023

Forecasting model of electricity market prosperity index based on multidimensional big data
Jia Liu ... Liang Wang
Journal of Physics: Conference Series | VOL. 1883
Jia Liu, et. al.Jia Liu ... Liang Wang
01 Apr 2021
Journal of Physics: Conference Series | VOL. 1883

An efficient parallel indexing structure for multi-dimensional big data using spark
Manar A. Elmeiligy ... Sally M. Elghamrawy
The Journal of Supercomputing | VOL. 77
Manar A. Elmeiligy, et. al.Manar A. Elmeiligy ... Sally M. Elghamrawy
22 Mar 2021
The Journal of Supercomputing | VOL. 77

Visual Analysis of Multidimensional Big Data: A Scalable Lightweight Bundling Method for Parallel Coordinates
Wenqiang Cui ... Hao Wang
IEEE Transactions on Big Data | VOL. 9
Wenqiang Cui, et. al.Wenqiang Cui ... Hao Wang
01 Feb 2023
IEEE Transactions on Big Data | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multidimensional Discrete Big Data Clustering Algorithm Based on Dynamic Grid

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing