Comparison on Feature Selection Methods for Text Classification

Wenkai Liu,Jiongen Xiao,Ming Hong

doi:10.1145/3380625.3380677

Abstract

The high-dimensional text data always contains a large quantity of noisy terms which bring negative effects on the performance of text classification. Feature selection is the common solution for dimension reduction in text classification. The choices of feature selection methods for text classification have significant impacts on classification accuracy. According to our literature review, few recent studies of feature selection focus on performance comparisons on feature selection methods. To fill this gap, this paper conducts discussions to compare performances of typical feature selection methods which are commonly involved in previous studies for text classification. Firstly, we introduce and discuss a series of typical feature selection methods in previous studies for text classification in details. Secondly, we conduct comparison experiments on four benchmark datasets to compare the effectiveness of twenty typical feature selection methods in text classification. Finally, we give conclusions on performance of the typical feature selection methods. The result of this paper gives a guideline for selecting appropriate feature selection methods for text classification academic analysis or real-world text classification applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison on Feature Selection Methods for Text Classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Research on Feature Selection and kNN Classification Method in Chinese Text Classification
Chao Xiao ... Ping Wu
-
Chao Xiao, et. al.Chao Xiao ... Ping Wu
01 Jan 2015
01 Jan 2015

Migrating birds optimization-based feature selection for text classification
Cem Kaya ... Zeynep Hilal Kilimci
PeerJ Computer Science | VOL. 10
Cem Kaya, et. al.Cem Kaya ... Zeynep Hilal Kilimci
30 Aug 2024
PeerJ Computer Science | VOL. 10

An evaluation of text classification methods for literary study
B Yu
Literary and Linguistic Computing | VOL. 23
B YuB Yu
05 Sep 2008
Literary and Linguistic Computing | VOL. 23

Utility-based feature selection for text classification
Heyong Wang ... Raymond Yiu Keung Lau
Knowledge and Information Systems | VOL. 61
Heyong Wang, et. al.Heyong Wang ... Raymond Yiu Keung Lau
08 Dec 2018
Knowledge and Information Systems | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison on Feature Selection Methods for Text Classification

Abstract

Talk to us

Similar Papers