Incremental support vector machine for unlabeled data classification

Jinhyuk Hong Jinhyuk Hong,Sung-Bae Cho Sung-Bae Cho

doi:10.1109/iconip.2002.1202851

Abstract

Due to the wide proliferation of the Internet and telecommunication, huge amount of information has been produced as digital data format. It is impossible to classify this information with one's own hand one by one in many realistic problems, so that the research on automatic text classification has been grown. Machine learning technologies have applied in text classification. However, the traditional statistic machine learning technologies require large number of labeled training examples to learn accurately. To obtain enough training examples, we have to label on these huge training examples by hand. This paper presents a supervised learning algorithm based on support vector machine (SVM) to classify text documents more accurately by using unlabeled documents to augment available labeled training examples. Experimental results indicate that the classification with unlabeled examples using SVM is superior to the conventional classification,with labeled examples.

Full Text