Abstract

The traditional SOM algorithm need to determine the number of clustering categories in advance, which is very subjective. In this paper, an improved k-means initial value selection algorithm is proposed to calculate the number of clustering categories, which is applied to SOM network model. In this algorithm, the Latent Semantic Indexing is applied in the pre-processing stage of clustering, and the improved SOM algorithm is applied in the text clustering stage. Namely, the number of clustering categories obtained by the improved k-means initial value selection algorithm is taken as the number of neurons in the output layer of SOM network.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call