Impact of Feature Selection for Data Classification Using Naive Bayes Classifier

Eman Hato

doi:10.1088/1742-6596/1879/2/022088

Abstract

In the field of data processing and analysis, the dataset may be a large set of features that restrict data usability and applicability, and thus the dimensions of data sets need to be reduced. Feature selection is the process of removing as much of the redundant and irrelevant features as possible from the original dataset to improve the mining process efficiency. This paper presented a study to evaluate and compare the effect of filter and wrapper methods as feature selection approaches in terms of classification accuracy and time complexity. The Naive Bayes Classifier and three classification datasets from the UCI repository are utilizing in the classification procedure. To investigate the effect of feature selection methods, they are applied to the different characteristics datasets to obtain the selected feature vectors which are then classified according to each dataset category. The datasets used in this paper are the Iris, Ionosphere, and Ovarian Cancer dataset. Experimental results indicate that the filter and wrapper methods provide approximately equal classification accuracy where the average accuracy value of the Ionosphere and Ovarian Cancer dataset is 0.78 and 0.91 for the same selected feature vectors respectively. For Iris dataset, the filter method outperforms the wrapper method by achieving the same accuracy value using only half number of selected features. The results also show that the filter method surpasses when considering the execution time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: May 1, 2021
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

Impact of Feature Selection for Data Classification Using Naive Bayes Classifier

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

A Review of Feature Selection Techniques in Sentiment Analysis Using Filter, Wrapper, or Hybrid Methods
Pulung Hendro Prastyo ... Igi Ardiyanto
-
Pulung Hendro Prastyo, et. al.Pulung Hendro Prastyo ... Igi Ardiyanto
07 Sep 2020
07 Sep 2020

Upper-Limb Motion Recognition Based on Hybrid Feature Selection: Algorithm Development and Validation.
Qiaoqin Li ... Yongguo Liu
JMIR mHealth and uHealth | VOL. 9
Qiaoqin Li, et. al.Qiaoqin Li ... Yongguo Liu
02 Sep 2021
JMIR mHealth and uHealth | VOL. 9

A novel feature selection approach for biomedical data classification
Yonghong Peng ... Jianmin Jiang
Journal of Biomedical Informatics | VOL. 43
Yonghong Peng, et. al.Yonghong Peng ... Jianmin Jiang
30 Jul 2009
Journal of Biomedical Informatics | VOL. 43

Hybrid filter–wrapper feature selection for short-term load forecasting
Zhongyi Hu ... Raymond Chiong
Engineering Applications of Artificial Intelligence | VOL. 40
Zhongyi Hu, et. al.Zhongyi Hu ... Raymond Chiong
27 Jan 2015
Engineering Applications of Artificial Intelligence | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Impact of Feature Selection for Data Classification Using Naive Bayes Classifier

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series