Aiming at the problem that the traditional algorithm is easy to fall into local optimum in the process of text clustering, which leads to inaccurate text clustering results, a text clustering method based on decision grey wolf optimization K-Means is proposed to cluster the text data set and the standard UCI data set respectively. Afterword segmentation, stop words removal, feature extraction, and text vectorization of text data, the powerful optimization ability of the Decision Gray Wolf Optimization (DGWO) algorithm is used for global optimization, and the clustering center in K-Means algorithm is replaced by the location of wolves. The position of the wolf group is updated by iterative optimization to obtain the optimal clustering center, to perform text clustering. The experimental results show that compared with the traditional method, the precision, recall, and F-Measure of the text data clustering are improved by 49.22%, 51.15%, and 48.98% respectively. The precision, recall, and F-Measure of UCI data clustering are increased by 23.92%, 25.40%, and 24.70% respectively, and the text clustering results are more reliable.
Read full abstract