Feature Selection for Multi-label Document Based on Wrapper Approach through Class Association Rules

Roiss Alhutaish,Nazlia Omar

doi:10.18517/ijaseit.7.2.1040

Abstract

Each document in a multi-label classification is connected to a subset of labels. These documents usually include a big number of features, which can hamper the performance of learning algorithms. Therefore, feature selection is helpful in isolating the redundant and irrelevant elements that can hold the performance back. The current study proposes a Naive Bayesian (NB) multi-label classification algorithm by incorporating a wrapper approach for the strategy of feature selection aiming at determining the best minimum confidence threshold. This paper also suggests transforming the multi-label documents prior to utilizing the standard algorithm of feature selection. In such a process, the document was copied into labels that belonged to by adopting all the assigned characteristics for each label. Then, this study conducted an evaluation of seven minimum confidence thresholds. Additionally, Class Association Rules (CARs) represents the wrapper approach for this evaluation. The experiments carried out with benchmark datasets revealed that the Naive Bayes Multi-label (NBML) classifier with business dataset scored an average precision of 87.9% upon using a 0.1 % of minimum confidence threshold.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal on Advanced Science, Engineering and Information Technology	Publication Date: Apr 21, 2017
Citations: 6	License type: cc-by-sa

R Discovery Prime

R Discovery Prime

Feature Selection for Multi-label Document Based on Wrapper Approach through Class Association Rules

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology

Lead the way for us

Similar Papers

A Method for Query Top-K Rules from Class Association Rule Set
Loan T T Nguyen ... Ngoc-Thanh Nguyen
-
Loan T T Nguyen, et. al.Loan T T Nguyen ... Ngoc-Thanh Nguyen
01 Jan 2015
01 Jan 2015

Research on multi-label user classification of social media based on ML-KNN algorithm
Anzhong Huang ... Meiwen Guo
Technological Forecasting & Social Change | VOL. 188
Anzhong Huang, et. al.Anzhong Huang ... Meiwen Guo
30 Dec 2022
Technological Forecasting & Social Change | VOL. 188

The Improved Naive Bayesian WEB Text Classification Algorithm
... Junqing Li
-
, et. al. ... Junqing Li
01 Dec 2009
01 Dec 2009

A Parallel Softmax Classification Algorithm Based on MapReduce
Zexi Chen ... Junyan Cheng
-
Zexi Chen, et. al.Zexi Chen ... Junyan Cheng
01 Aug 2018
01 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Selection for Multi-label Document Based on Wrapper Approach through Class Association Rules

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology