A space-structure based dissimilarity measure for categorical data

Kevin Alejandro Hernández,D Cárdenas Peña,Álvaro A Orozco

doi:10.11591/ijece.v11i1.pp620-627

Abstract

The development of analysis methods for categorical data begun in 90's decade, and it has been booming in the last years. On the other hand, the performance of many of these methods depends on the used metric. Therefore, determining a dissimilarity measure for categorical data is one of the most attractive and recent challenges in data mining problems. However, several similarity/dissimilarity measures proposed in the literature have drawbacks due to high computational cost, or poor performance. For this reason, we propose a new distance metric for categorical data. We call it: Weighted pairing (W-P) based on feature space-structure, where the weights are understood like a degree of contribution of an attribute to the compact cluster structure. The performance of W-P metric was evaluated in the unsupervised learning framework in terms of cluster quality index. We test the W-P in six real categorical datasets downloaded from the public UCI repository, and we make a comparison with the distance metric (DM3) method and hamming metric (H-SBI). Results show that our proposal outperforms DM3 and H-SBI in different experimental configurations. Also, the W-P achieves highest rand index values and a better clustering discriminant than the other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A space-structure based dissimilarity measure for categorical data

Abstract

Talk to us

Similar Papers

More From: International Journal of Electrical and Computer Engineering (IJECE)

Lead the way for us

Journal: International Journal of Electrical and Computer Engineering (IJECE)	Publication Date: Feb 1, 2021
License type: CC BY-SA 4.0

Similar Papers

Unsupervised Deep Feature Learning With Iteratively Refined Pseudo Classes for Scene Representation
Zhiqiang Gong ... Weidong Hu
IEEE Access | VOL. 7
Zhiqiang Gong, et. al.Zhiqiang Gong ... Weidong Hu
01 Jan 2019
IEEE Access | VOL. 7

Unsupervised Learning-Based Depth Estimation-Aided Visual SLAM Approach
Mingyang Geng ... Suning Shang
Circuits, Systems, and Signal Processing | VOL. 39
Mingyang Geng, et. al.Mingyang Geng ... Suning Shang
19 Jun 2019
Circuits, Systems, and Signal Processing | VOL. 39

Physics-informed Unsupervised Deep Learning Framework for Solving Full-Wave Inverse Scattering Problems
Che Liu ... Tiejun Cui
-
Che Liu, et. al.Che Liu ... Tiejun Cui
14 Dec 2022
14 Dec 2022

An unsupervised feature learning framework for basal cell carcinoma image analysis
John Arevalo ... Viviana Arias
Artificial Intelligence in Medicine | VOL. 64
John Arevalo, et. al.John Arevalo ... Viviana Arias
23 Apr 2015
Artificial Intelligence in Medicine | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A space-structure based dissimilarity measure for categorical data

Abstract

Talk to us

Similar Papers

More From: International Journal of Electrical and Computer Engineering (IJECE)