Feature selection based on term frequency deviation rate for text classification

Hongfang Zhou,Xiang Li,Yiming Ma

doi:10.1007/s10489-020-01937-4

Feature selection based on term frequency deviation rate for text classification

Hongfang Zhou, Xiang Li + Show 1 more

https://doi.org/10.1007/s10489-020-01937-4

Copy DOI

Journal: Applied intelligence (Dordrecht, Netherlands)	Publication Date: Nov 11, 2020
Citations: 10

Affiliation: Xi'an University of Technology

#Support Vector Machine #Classifiers Of Support Vector Machine + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Feature selection is a technique to select a subset of the most relevant features for modeling training. In this paper, a new concept of TDR is firstly proposed to improve the classification accuracy. Then, a TDR-based algorithm for text classification is advanced. Finally, the extensive experiments are made on seven datasets (K1a, K1b, WAP, R52, R8, 20NewGroups, and Cade12) for two classifiers of Naive Bayes and Support Vector Machine. The experimental results indicate that the new approach can improve the classification accuracy by an average percent of 7.9%.

Full Text