A classification algorithm based on multi-dimensional fuzzy transforms

Ferdinando Di Martino,Salvatore Sessa

doi:10.1007/s12652-021-03336-0

Abstract

We present a new classification algorithm for machine learning numerical data based on direct and inverse fuzzy transforms. In our previous work fuzzy transforms were used for numerical attribute dependency in data analysis: the multi-dimensional inverse fuzzy transform was used to approximate the regression function. Also here the classification method presented is based on this operator. Strictly speaking, we apply the K-fold cross-validation algorithm for controlling the presence of over-fitting and for estimating the accuracy of the classification model: for each training (resp., testing) subset an iteration process evaluates the best fuzzy partitions of the inputs. Finally, a weighted mean of the multi-dimensional inverse fuzzy transforms calculated for each training subset (resp., testing) is used for data classification. We compare this algorithm on well-known datasets with other five classification methods.

Highlights

In this paper we propose a new classification algorithm, called multi-dimensional F-transform classification, in which the direct and inverse multi-dimensional fuzzy transforms are used to classify instances
Our goal is to build a robust classification model based on the multi-dimensional fuzzy transform, which integrates a K-fold cross validation resampling method to overcome the problem of data over-fitting, which represents one of the major problems of classification models
We present our experiments on known sample of over 100 datasets extracted from UCI machine Learning repository and from Knowledge Extraction Evolution Learning (KEEL) dataset

Summary

Introduction

In this paper we propose a new classification algorithm, called multi-dimensional F-transform classification (for short, MFC), in which the direct and inverse multi-dimensional fuzzy transforms are used to classify instances. Our goal is to build a robust classification model based on the multi-dimensional fuzzy transform, which integrates a K-fold cross validation resampling method to overcome the problem of data over-fitting, which represents one of the major problems of classification models. The presence of under-fitting is evaluated by measuring a performance index after the learning process. Over-fitting represents a problem versus under-fitting and it occurs when a machine learning algorithm captures noise in the data and the data in the training dataset are optimally fitted, but the fitting itself is less accurate. There are two main techniques that we can use to limit overfitting in machine learning algorithms: to measure the accuracy by using a validation dataset or to adopt a resampling technique by using random subsets of data.

The MFC algorithm

Multi‐dimensional F‐transforms

Proposed algorithm

14 NEXT k

Tests and results

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Ambient Intelligence and Humanized Computing	Publication Date: Jun 21, 2021
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

A classification algorithm based on multi-dimensional fuzzy transforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Ambient Intelligence and Humanized Computing

Lead the way for us

Similar Papers

MODELING OF THE SPATIAL DISTRIBUTION OF CHROME AND MANGANESE IN SOIL: SELECTION OF A TRAINING SUBSET
A S Butorova ... E M Baglaeva
Геоэкология. Инженерная геология. Гидрогеология. Геокриология | VOL. -
A S Butorova, et. al.A S Butorova ... E M Baglaeva
01 Sep 2023
Геоэкология. Инженерная геология. Гидрогеология. Геокриология | VOL. -

Machine learning-based modified BAT score in predicting hematoma enlargement after spontaneous intracerebral hemorrhage
Hongli Zhou ... Xin Li
Journal of Clinical Neuroscience | VOL. 93
Hongli Zhou, et. al.Hongli Zhou ... Xin Li
24 Sep 2021
Journal of Clinical Neuroscience | VOL. 93

Gestational dating by metabolic profile at birth: a California cohort study
Laura L Jelliffe-Pawlowski ... George W Rutherford
American Journal of Obstetrics and Gynecology | VOL. 214
Laura L Jelliffe-Pawlowski, et. al.Laura L Jelliffe-Pawlowski ... George W Rutherford
11 Dec 2015
American Journal of Obstetrics and Gynecology | VOL. 214

Survival and risk model for stage IB non-small cell lung cancer
J Padilla ... F Parı́S
Lung Cancer | VOL. 36
J Padilla, et. al.J Padilla ... F Parı́S
07 Dec 2001
Lung Cancer | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A classification algorithm based on multi-dimensional fuzzy transforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Ambient Intelligence and Humanized Computing