Two-Stage Feature Selection for Text Classification

Levent Özgür,Tunga Güngör

doi:10.1007/978-3-319-22635-4_30

Two-Stage Feature Selection for Text Classification

Levent Özgür, Tunga Güngör

https://doi.org/10.1007/978-3-319-22635-4_30

Copy DOI

Publication Date: Aug 4, 2015

Citations: 8

Affiliation: Boğaziçi University

#Feature Selection For Text Classification #Text Classification Domain + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we focus on feature coverage policies used for feature selection in the text classification domain. Two alternative policies are discussed and compared: corpus-based and class-based selection of features. We make a detailed analysis of pruning and keyword selection by varying the parameters of the policies and obtain the optimal usage patterns. In addition, by combining the optimal forms of these methods, we propose a novel two-stage feature selection approach. The experiments on three independent datasets showed that the proposed method results in a statistically significant increase over the traditional methods in the success rates of the classifier.

Full Text