Abstract
Natural Language Processing (NLP) is a science that integrates computer knowledge, mathematical knowledge and linguistic knowledge, and text classification and recognition is considered to be an important research field and direction of natural language processing. This paper mainly studies the realization of text classification model through the processing method of text data in natural language processing and the theoretical knowledge and technical means of machine learning. We summarize the existing text classification algorithms, analyze their applicable scenarios, and optimize on the basis of these algorithm models. The paper proposes a Chinese text classification algorithm based on weight preprocessing. Algorithm based on the weight preprocessing link, so that the optimized classifier model can improve the accuracy of the existing text classifier. In this paper, the English Newsgroups corpus is used for experimental verification. The experimental results show that the classification accuracy and accuracy of the improved algorithm are better than those of the traditional text classification algorithm, thereby improving the accuracy of English text classification.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have