Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm

Qiang Gao,Jing Feng

doi:10.1109/hpcc/smartcity/dss.2019.00171

Abstract

The K-means algorithm is one of the most frequently used clustering algorithms in hot topic discovery. However, due to its shortcomings such as the number of clusters K value and easy to fall into local optimum, the clustering accuracy is not high, which directly affects the quality of hotspot discovery. This paper proposes an improved K-means algorithm to achieve fast clustering of microblog texts. Combining the high-frequency words and similarities of the microblog texts to perform single-pass clustering, the K number of clusters and the initial clustering center are obtained, which solves the problem that the K-means algorithm is too sensitive to the K value and the initial center. Through experimental comparison and analysis, it makes up for the shortcomings of K-means algorithm, and effectively improves the efficiency and accuracy of clustering. Applying it to the hot topic discovery model, the effectiveness of the hot spot discovery model based on the improved K-means algorithm is verified by experiments, and it has a high accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An Optimized k-means Algorithm Based on Information Entropy
Meiling Liu ... Beixian Zhang
The Computer Journal | VOL. 64
Meiling Liu, et. al.Meiling Liu ... Beixian Zhang
04 Jun 2021
The Computer Journal | VOL. 64

An Improved K-means Text Clustering Algorithm by Optimizing Initial Cluster Centers
Caiquan Xiong ... Xuan Li
-
Caiquan Xiong, et. al.Caiquan Xiong ... Xuan Li
01 Nov 2016
01 Nov 2016

An Improved Clustering Method Based on Data Field
Yu Hua Liu ... Jian Zhi Jin
Applied Mechanics and Materials | VOL. 457-458
Yu Hua Liu, et. al.Yu Hua Liu ... Jian Zhi Jin
01 Oct 2013
Applied Mechanics and Materials | VOL. 457-458

Research and Application of Improved K-means Algorithm in Text Clustering
Shen-Yi Qian ... Dai-Yi Li
DEStech Transactions on Computer Science and Engineering | VOL. -
Shen-Yi Qian, et. al.Shen-Yi Qian ... Dai-Yi Li
27 Jun 2018
DEStech Transactions on Computer Science and Engineering | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm

Abstract

Talk to us

Similar Papers