Abstract

Due to the wide proliferation of the Internet and telecommunication, huge amount of information has been produced as digital data format. It is impossible to classify this information with one's own hand one by one in many realistic problems, so that the research on automatic text classification has been grown. Machine learning technologies have applied in text classification. However, the traditional statistic machine learning technologies require large number of labeled training examples to learn accurately. To obtain enough training examples, we have to label on these huge training examples by hand. This paper presents a supervised learning algorithm based on support vector machine (SVM) to classify text documents more accurately by using unlabeled documents to augment available labeled training examples. Experimental results indicate that the classification with unlabeled examples using SVM is superior to the conventional classification,with labeled examples.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.