Kernel-based data transformation model for nonlinear classification of symbolic data

Xuanhui Yan,Gongde Guo,Lifei Chen

doi:10.1007/s00500-021-06600-9

Abstract

Kernel-based data transformation model for nonlinear classification of symbolic data

Highlights

Symbolic data, alternatively known as categorical data or nominal data, are widely used in real-world applications, where the attributes are represented by symbols, which are qualitative category of things [1]
In the K2 NN algorithm [38], which is an extension to the conventional k-nearest neighbors (KNN) classifier, a weighted simple matching (SM) distance measure was derived based on the kernel density estimation (KDE) on symbolic data; in [39], three new linear classifiers were defined for symbolic data classification and, interestingly, it was demonstrated that the classes can be more separable by kernel learning of symbolic attributes
This subsection aims at deriving an Support Vector Machine (SVM) for non-linear classification of symbolic data, named SVM-S, using our new data transformation model KDTM and the inner product or distance measure formulated in the previous subsections

Summary

Introduction

Alternatively known as categorical data or nominal data, are widely used in real-world applications, where the attributes are represented by symbols, which are qualitative category of things [1]. A number of methods have been developed to classify symbolic data, including decision trees (DT), Naive Bayes (NB) [9] and distance-based methods such as the k-nearest neighbors (KNN) and the prototype-based classifiers [10, 11] Since both DT and NB are typically based on the assumption that symbolic attributes are conditionally independent given the class attribute, they cannot identify the non-linear correlation between attributes, which has been validated to be useful in high-quality classification [12, 13]. The non-linear Support Vector Machine (SVM) [18] makes use of Mercer kernel functions to embed raw objects into a reproducing kernel Hilbert space, such that the data can be classified in the new space with high-quality Such a method cannot be directly applied to non-linear symbolic data classification, because, essentially, it is designed for numeric data, where the Mercer kernels and some key intermediate operations, such as inner product, are well-defined. Popular solution to this problem is to transform symbolic data into numeric data as a preprocessing, using a frequency estimation-based encoding model such as the well-known One-Hot

B Lifei Chen

A sampling of classification methods for symbolic data

Data transformation methods

Kernel learning on symbolic data

Discrete kernel estimation

Bandwidth optimization

Kernel-based self-representation model

Inner product and distance measures of symbolic data

SVM-S: SVM for symbolic data

Data sets and experimental setup

Classification performance

Attribute-weight analysis

Findings

Concluding remarks

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Soft Computing	Publication Date: Jan 17, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Kernel-based data transformation model for nonlinear classification of symbolic data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Soft Computing

Lead the way for us

Similar Papers

Hybridization of Neutrosophic Logic with Quasi-Oppositional Chimp Optimization based Data Classification Model
Sundus Naji Al Al-Aziz ... Abd Al-Aziz Hosni El El-Bagoury
International Journal of Neutrosophic Science | VOL. 18
Sundus Naji Al Al-Aziz, et. al.Sundus Naji Al Al-Aziz ... Abd Al-Aziz Hosni El El-Bagoury
01 Jan 2021
International Journal of Neutrosophic Science | VOL. 18

Research on wireless sensor privacy data measurement and classification model based on IoT technology
Jianye Wang ... Haoping An
International Journal of Information and Communication Technology | VOL. 22
Jianye Wang, et. al.Jianye Wang ... Haoping An
01 Jan 2023
International Journal of Information and Communication Technology | VOL. 22

Concept Drift Detection with Optimal Machine Learning Model for Data Classification
S Caxton Emerald ... T Vengattaraman
-
S Caxton Emerald, et. al.S Caxton Emerald ... T Vengattaraman
28 Apr 2022
28 Apr 2022

Automated Maintenance Data Classification Using Recurrent Neural Network: Enhancement by Spotted Hyena-Based Whale Optimization
Mustufa Haider Abidi ... Mohamed K Aboudaif
Mathematics | VOL. 8
Mustufa Haider Abidi, et. al.Mustufa Haider Abidi ... Mohamed K Aboudaif
11 Nov 2020
Mathematics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel-based data transformation model for nonlinear classification of symbolic data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Soft Computing