Research and application of improved K-means based on MapReduce

Hongqin Wang,Li Jiang,Zhengjun Pan,Hongxia Wang

doi:10.1088/1742-6596/1651/1/012074

Research and application of improved K-means based on MapReduce

Hongqin Wang, Li Jiang + Show 2 more

Open Access

https://doi.org/10.1088/1742-6596/1651/1/012074

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Nov 1, 2020
License type: cc-by

Affiliation: Institute of Software, Henan Institute of Technology

#Improved K-Means Algorithm #Big Data + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

With the development of big data, the traditional data mining clustering algorithm K-Means is inefficient and has poor scalability in dealing with massive data. MapReduce on the Hadoop platform was used to realize the parallel processing of the K-Means algorithm, the performance of the algorithm was tested by experiments. The results show that the improved K-Means algorithm has good parallel expansion capability, high efficiency, and great potential when processing big data mining. The algorithm is applied to the big data processing of customer consumption in a restaurant chain, and the effectiveness of the algorithm is verified, which can better serve the decision of restaurant.

Full Text