Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction

Shalini Puri

doi:10.1007/978-981-33-6984-9_28

Abstract

With the generation of enormous data day by day, the need of feature reduction has tremendously increased in the field of text classification. In this direction, this paper presents two text classification systems, called concept-based mining model using threshold (CMMT) and fuzzy similarity-based concept mining model using feature clustering (FSCMM-FC). Both systems aim to classify the English text documents into pre-defined mutually exclusive categories. These systems preprocess the documents at the sentence, document, and integrated corpora levels; apply feature extraction and reduction; train the classifier; and finally, classify the documents using support vector machine. CMMT cuts off the less frequent features by applying threshold on the extracted features, whereas FSCMM-FC reduces the features by finding the feature points using fuzzy C-means. The experimental results obtained 95.8% and 94.695% feature reduction in CMMT and FSCMM-FC, respectively, and also the 85.41% and 93.43% classification accuracy in CMMT and FSCMM-FC, respectively. Therefore, these results state that FSCMM-FC outperformed CMMT greatly with effective memory usage and efficient classification accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Feature Reduction for Support Vector Machines
Shouxian Cheng ... Frank Y Shih
-
Shouxian Cheng, et. al.Shouxian Cheng ... Frank Y Shih
01 Jan 2009
01 Jan 2009

Improving text classification with word embedding
Lihao Ge ... Teng-Sheng Moh
-
Lihao Ge, et. al.Lihao Ge ... Teng-Sheng Moh
01 Dec 2017
01 Dec 2017

Value of feature reduction for crop differentiation using multi-temporal imagery, machine learning, and object-based image analysis
J.K Gilbertson ... A Van Niekerk
-
J.K Gilbertson, et. al.J.K Gilbertson ... A Van Niekerk
01 Jan 2015
01 Jan 2015

An evaluation of text classification methods for literary study
B Yu
Literary and Linguistic Computing | VOL. 23
B YuB Yu
05 Sep 2008
Literary and Linguistic Computing | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction

Abstract

Talk to us

Similar Papers