EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method

Mehmet Hacibeyoglu,Mohammed H Ibrahim

doi:10.1007/s13369-018-3144-z

Abstract

Discretization is an important data preprocessing technique used in data mining and knowledge discovery processes. The purpose of discretization is to transform or partition continuous values into discrete ones. In this manner, many data mining classification algorithms can be applied the discrete data more concisely and meaningfully than continuous ones, resulting in better performance. In this study, an improved version of the unsupervised equal frequency (EF) discretization method, EF_Unique, is proposed for enhancing the performance of discretization. The proposed EF_Unique discretization method is based on the unique values of the attribute to be discretized. In order to test the success of the proposed method, 17 benchmark datasets from the UCI repository and four data mining classification algorithms were used, namely Naive Bayes, C.45, k-nearest neighbor, and support vector machine. The experimental results of the proposed EF_Unique discretization method were compared with those obtained using well-known discretization methods; unsupervised equal width (EW), EF, and supervised entropy-based ID3 (EB-ID3). The results show that the proposed EF_Unique discretization method outperformed EW, EF, and EB-ID3 discretization methods in 43, 41, and 27 out of the 68 benchmark tests, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method

Abstract

Talk to us

Similar Papers

More From: Arabian Journal for Science and Engineering

Lead the way for us

Journal: Arabian Journal for Science and Engineering	Publication Date: Mar 3, 2018
Citations: 9

Similar Papers

Performance Comparison of Equal Width and Equal Frequency Discretization Methods for Author’s Handwriting Recognition
Intan Ermahani A Jalil ... Sabrina Ahmad
-
Intan Ermahani A Jalil, et. al.Intan Ermahani A Jalil ... Sabrina Ahmad
01 Jan 2021
01 Jan 2021

Unsupervised discretization method based on adjustable intervals
...
-
, et. al. ...
01 Jan 2012
01 Jan 2012

A survey of data mining and knowledge discovery process models and methodologies
Gonzalo Mariscal ... Covadonga Fernández
The Knowledge Engineering Review | VOL. 25
Gonzalo Mariscal, et. al.Gonzalo Mariscal ... Covadonga Fernández
01 Jun 2010
The Knowledge Engineering Review | VOL. 25

Evaluation of an integrated Knowledge Discovery and Data Mining process model
Sumana Sharma ... George M Kasper
Expert Systems With Applications | VOL. 39
Sumana Sharma, et. al.Sumana Sharma ... George M Kasper
20 Feb 2012
Expert Systems With Applications | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method

Abstract

Talk to us

Similar Papers

More From: Arabian Journal for Science and Engineering