Discretization of Unlabeled Data using RST & Clustering

Girish Kumar Singh ,Shrabanti Mandal

doi:10.26713/jims.v11i1.890

Abstract

An algorithm can be applied on numerical or continuous attributes as well as on nominal or discrete value. If input to an algorithm required only attributes of nominal or discrete type then continuous attributes of the dataset need to be discretize before applying such algorithm. Discretization method can be of two types namely supervised and unsupervised. Supervised methods of dicretization utilize class labels of the dataset while in unsupervised method class labels are totally disregarded. In many literatures it has been shown that supervised methods gives good discretization result. Supervised algorithms cannot apply if dataset is unlabeled. In real life, many dataset do not have class (label) attribute and only unsupervised discretization methods are applicable in such cases. This paper presents discretization schemes for unlabeled data based on RST (Rough Set Theory) and clustering. The experiments have been performed to compare the proposed technique with other discretization methods for labeled data on two benchmark datasets. Two parameters Class-Attribute Interdependence Redundancy and the total number of intervals have been used to compare the proposed techniques with other existing techniques. The results display a satisfactory tradeoff between the information loss and number of intervals for the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discretization of Unlabeled Data using RST & Clustering

Abstract

Talk to us

Similar Papers

More From: Journal of Informatics and Mathematical Sciences

Lead the way for us

Similar Papers

LFIT: an unsupervised discretization method based on the Ramer–Douglas–Peucker algorithm
Alev Mutlu ... Furkan Göz
TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES | VOL. 27
Alev Mutlu, et. al.Alev Mutlu ... Furkan Göz
15 May 2019
TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES | VOL. 27

An Approach to Educational Data Mining Model Accuracy Improvement Using Histogram Discretization and Combining Classifiers into an Ensemble
Gabrijela Dimić ... Dejan Rančić
-
Gabrijela Dimić, et. al.Gabrijela Dimić ... Dejan Rančić
01 Jan 2019
01 Jan 2019

Stability and Physical Accuracy Analysis of the Numerical Solutions to Wigner-Poisson Modeling of Resonant Tunneling Diodes
Boris Gelmont ... Tatiana Globus
-
Boris Gelmont, et. al.Boris Gelmont ... Tatiana Globus
22 Mar 2013
22 Mar 2013

Association Rule Mining for Continuous Attributes using Genetic Network Programming
Karla Taboada ... Eloy Gonzales
IEEJ Transactions on Electrical and Electronic Engineering | VOL. 3
Karla Taboada, et. al.Karla Taboada ... Eloy Gonzales
22 Feb 2008
IEEJ Transactions on Electrical and Electronic Engineering | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discretization of Unlabeled Data using RST & Clustering

Abstract

Talk to us

Similar Papers

More From: Journal of Informatics and Mathematical Sciences