Chinese text classification for small sample set

Lei Li,Yu-Guang Huang,Zhong-Wan Liu

doi:10.1016/s1005-8885(10)60205-1

Abstract

Text classification is one of the most important topics in the fields of Internet information management and natural language processing. Machine learning based text classification methods are currently most popular ones with better performance than rule based ones. But they always need lots of training samples, which not only brings heavy work for previous manual classification, but also puts forward a higher request for storage and computing resources during the computer post-processing. Naïve Bayes algorithm is one of the most effective methods for text classification with the same problem. Only in the large training sample set can it get a more accurate result. This paper mainly studies Naïve Bayes classification algorithm for Chinese text based on Poisson distribution model and feature selection. The experimental results have shown that this method keeps high classification accuracy even in a small sample set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Chinese text classification for small sample set

Abstract

Talk to us

Similar Papers

More From: The Journal of China Universities of Posts and Telecommunications

Lead the way for us

Journal: The Journal of China Universities of Posts and Telecommunications	Publication Date: Sep 1, 2011
Citations: 3

Similar Papers

Naive Bayes classification algorithm based on small sample set
Yuguang Huang ... Lei Li
-
Yuguang Huang, et. al.Yuguang Huang ... Lei Li
01 Sep 2011
01 Sep 2011

Research On Text Classification Based On Deep Neural Network
Deageon Kim
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14
Deageon KimDeageon Kim
31 Dec 2022
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14

Research on text classification technology based on natural language processing
Dandan Song
-
Dandan SongDandan Song
20 Oct 2022
20 Oct 2022

Connecting Text Classification with Image Classification: A New Preprocessing Method for Implicit Sentiment Text Classification.
Meikang Chen ... Kurban Ubul
Sensors (Basel, Switzerland) | VOL. 22
Meikang Chen, et. al.Meikang Chen ... Kurban Ubul
28 Feb 2022
Sensors (Basel, Switzerland) | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Chinese text classification for small sample set

Abstract

Talk to us

Similar Papers

More From: The Journal of China Universities of Posts and Telecommunications