Abstract

There are relevance and redundancy of the feature words in the text vector space, so we proposed a text-reducing method based on the improved KNN algorithm in this paper. Vector polymer theory and feature selection methods were used to reducing the dimension of vector space. Feature words would have more ability to represent categories after feature selection. Experiments proved, the improved KNN algorithm were used in text-reducing not only can reducing the dimension of vector space more effectively, but also can improving the speed and accuracy of the text classify.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call