Abstract

With the development of Internet and the rapid expansion of electronic resource, text classification technology is becoming an effective organization and management tool to deal with information. In this paper, a method for text categorization based on the fusion of multiple classifiers was presented, reliability function was introduction to select the text that hard to give determine by the main classifier, for these texts, multiple classifiers were used to give the determine which category the unlabeled documents belong to by voting. Experiments showed that the performance of text classification improved by the proposed method. Compared with single classifier, this method achieved better performance, only increasing a small amount of time than using single main classifier. Besides this, this method is more stable than using single classifier for text categorization task, especially when using different corpuses to check the performance of various methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call