An Examination of Feature Selection Frameworks in Text Categorization

Bong Chih How,Wong Ting Kiong

doi:10.1007/11562382_50

Abstract

Feature selection, an important task in text categorization, is used for the purpose of dimensionality reduction. Feature selection basically can be performed locally and globally. For local selection, distinct feature sets are derived from different classes. The number of feature set is thus depended on the number of class. In contrary, only one universal feature set will be used in global feature selection. It is assumed that the feature set should preserve the characteristic of all classes. Furthermore, feature selection can also be carried out based on relevant feature set only (local dictionary) or both relevant and irrelevant feature set (universal dictionary). In this paper, we explored the different frameworks of feature selection to the task of text categorization on the Reuters(10) and Reuters(115) datasets (variants of Reuters-21578 corpus). We then investigate the efficiency of 7 different local or global feature selections corresponds the use of local and universal dictionary. Our experiments have shown that local feature selection with local dictionary yields optimal categorization results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Examination of Feature Selection Frameworks in Text Categorization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Local-to-global semi-supervised feature selection
Mohammed Hindawi ... Khalid Benabdeslem
-
Mohammed Hindawi, et. al.Mohammed Hindawi ... Khalid Benabdeslem
01 Jan 2013
01 Jan 2013

BRAIN TUMOR CLASSIFICATION BASED ON CLUSTERED DISCRETE COSINE TRANSFORM IN COMPRESSED DOMAIN
V Anitha ... S Murugavalli
Journal of Computer Science | VOL. 10
V Anitha, et. al.V Anitha ... S Murugavalli
01 Oct 2014
Journal of Computer Science | VOL. 10

Local feature selection for multiple instance learning with applications.
Aliasghar Shahrjooihaghighi
-
Aliasghar ShahrjooihaghighiAliasghar Shahrjooihaghighi
04 Oct 2022
04 Oct 2022

An Automated Text Classification Method: Using Improved Fuzzy Set Approach for Feature Selection
Bushra Zaheer Abbasi ... Shahid Hussain
-
Bushra Zaheer Abbasi, et. al.Bushra Zaheer Abbasi ... Shahid Hussain
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Examination of Feature Selection Frameworks in Text Categorization

Abstract

Talk to us

Similar Papers