Improving Text Classification by Web Corpora

Rafael Guzmán,Paolo Rosso,Manuel Montes,Luis Villaseñor

doi:10.1007/978-3-540-72575-6_25

Improving Text Classification by Web Corpora

Rafael Guzmán, Paolo Rosso + Show 2 more

Open Access

https://doi.org/10.1007/978-3-540-72575-6_25

Copy DOI

Publication Date: Jan 1, 2007

Citations: 13

Affiliation: Universitat Politècnica de València, Universidad de Guanajuato, National Institute of Astrophysics, Optics and Electronics

#Unlabeled Examples #Improving Text Classification + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

A major difficulty of supervised approaches for text classification is that they require a great number of training instances in order to construct an accurate classifier. This paper proposes a semi-supervised method that is specially suited to work with very few training examples. It considers the automatic extraction of unlabeled examples from the Web as well as an iterative integration of unlabeled examples into the training process. Preliminary results indicate that our proposal can significantly improve the classification accuracy in scenarios where there are less than ten training examples available per class.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.