Performance Enhancement of the Unbalanced Text Classification Problem Through a Modified Chi Square-Based Feature Selection Technique

Santosh Kumar Behera,Rajashree Dash

doi:10.4018/ijiit.309581

Abstract

This paper proposes a modified chi square-based feature selection algorithm in conjunction with a random vector functional link network-based text classifier for improving the classification performance of multi-labeled text documents with unbalanced class distributions. In the proposed feature selection method, maximum features are selected from classes that have a great deal of training and testing documents as an improvement towards original chi-square method. On two benchmark datasets that are multi-labeled, multi-class, and unbalanced, a comparison of the model with three conventional selection techniques such as chi-square, term frequency-inverse document frequency, and mutual information is accumulated for assessing its effectiveness. Additionally, the proposed model is compared with four different classifiers. In the study, it was found that the proposed model performs better in terms of precision, recall, f-measure, and hamming losses and is able to select the majority of true positive documents despite an unbalanced class distribution for both the datasets.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Performance Enhancement of the Unbalanced Text Classification Problem Through a Modified Chi Square-Based Feature Selection Technique

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Intelligent Information Technologies

Lead the way for us

Journal: International Journal of Intelligent Information Technologies	Publication Date: Sep 23, 2022
License type: CC BY 3.0

Similar Papers

Method on Alzheimer’s Disease Classification Utilizing Fuzzy Logic Feature Selection and Heterogeneous Ensemble Learning
...
-
, et. al. ...
01 Jan 2020
01 Jan 2020

Feature Selection for Text Classification Using Machine Learning Approaches
K Thirumoorthy ... K Muneeswaran
National Academy Science Letters | VOL. 45
K Thirumoorthy, et. al.K Thirumoorthy ... K Muneeswaran
24 Mar 2021
National Academy Science Letters | VOL. 45

A Filter Approach to Feature Selection Based on Survival Cauchy-Schwartz Mutual Information
Su Xiangchenyang ... Liu Fang
-
Su Xiangchenyang, et. al.Su Xiangchenyang ... Liu Fang
01 Jun 2018
01 Jun 2018

A novel feature selection framework for automatic web page classification
J Alamelu Mangai ... S Appavu Alias Balamurugan
International Journal of Automation and Computing | VOL. 9
J Alamelu Mangai, et. al.J Alamelu Mangai ... S Appavu Alias Balamurugan
01 Aug 2012
International Journal of Automation and Computing | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Performance Enhancement of the Unbalanced Text Classification Problem Through a Modified Chi Square-Based Feature Selection Technique

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Intelligent Information Technologies