An Improved Robust Fuzzy Algorithm for Unsupervised Learning

Amina Dik,Khalid Jebari,Aziz Ettouhami

doi:10.1515/jisys-2018-0030

Amina Dik, Khalid Jebari + Show 1 more

Open Access

https://doi.org/10.1515/jisys-2018-0030

Copy DOI

Abstract

Abstract This paper presents a robust, dynamic, and unsupervised fuzzy learning algorithm (RDUFL) that aims to cluster a set of data samples with the ability to detect outliers and assign the numbers of clusters automatically. It consists of three main stages. The first (1) stage is a pre-processing method in which possible outliers are determined and quarantined using a concept of proximity degree. The second (2) stage is a learning method, which consists in auto-detecting the number of classes with their prototypes for a dynamic threshold. This threshold is automatically determined based on the similarity among the detected prototypes that are updated at the exploration of a new data. The last (3) stage treats quarantined samples detected from the first stage to determine whether they belong to some class defined in the second phase. The effectiveness of this method is assessed on eight real medical benchmark datasets in comparison to known unsupervised learning methods, namely, the fuzzy c-means (FCM), possibilistic c-means (PCM), and noise clustering (NC). The obtained accuracy of our scheme is very promising for unsupervised learning problems.

Highlights

Clustering is one of the most relevant data-mining tasks [42]
The last (3) stage treats quarantined samples detected from the first stage to determine whether they belong to some class defined in the second phase. The effectiveness of this method is assessed on eight real medical benchmark datasets in comparison to known unsupervised learning methods, namely, the fuzzy c-means (FCM), possibilistic c-means (PCM), and noise clustering (NC)
To assess the performance of our approach, some experiments were conducted on an artificial dataset X1, and on eight real-world databases that are available in UCI [8]: Lymphography, Diabetes, Indian, Haberman’s Survival, BCW, Post-operative Patient, Parkinsons, and EEG Eyes State

Summary

Introduction

Clustering is one of the most relevant data-mining tasks [42]. It is the process of organizing objects into a set of classes. We propose a robust approach, which allows clustering data by auto-detecting the classes they form and providing the existing outliers without giving any parameter. The proposed approach consists of three stages: – A pre-processing stage using similarity to detect objects likely to be outliers and which will be considered as possible outliers These objects are quarantined and excluded from the second stage. – A second stage in which classes are determined based on a dynamic threshold This threshold is based on the minimum similarity among the detected prototypes, which are updated at the exploration of any new object. – A final stage, which is a processing of possible outliers in order to determine whether they belong to one of these classes detected in the second phase.

Related Work

The PCM Algorithm

The Robust-FCM Algorithm

Learning Phase

Treatment of Possible Outliers

Results and Discussion

Artificial Dataset

Real-World Dataset

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Intelligent Systems	Publication Date: Oct 25, 2018
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Improved Robust Fuzzy Algorithm for Unsupervised Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Intelligent Systems

Lead the way for us

Similar Papers

Unsupervised Machine Learning Methods for Artifact Removal in Electrodermal Activity.
Sandya Subramanian ... Emery N Brown
Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference | VOL. 2021
Sandya Subramanian, et. al.Sandya Subramanian ... Emery N Brown
01 Nov 2021
01 Nov 2021

Unsupervised learning on scientific ocean drilling datasets from the South China Sea
...
Frontiers of earth science | VOL. 13
, et. al. ...
04 Jun 2018
Frontiers of earth science | VOL. 13

An unsupervised learning approach to study synchroneity of past events in the South China Sea
Kevin C Tse ... Man-Yin Tsang
Frontiers of earth science | VOL. 13
Kevin C Tse, et. al.Kevin C Tse ... Man-Yin Tsang
08 Aug 2019
Frontiers of earth science | VOL. 13

Identification of multi-element geochemical anomalies using unsupervised machine learning algorithms: A case study from Ag–Pb–Zn deposits in north-western Zhejiang, China
Jun Wang ... Fan Xiao
Applied Geochemistry | VOL. 120
Jun Wang, et. al.Jun Wang ... Fan Xiao
11 Jul 2020
Applied Geochemistry | VOL. 120

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Improved Robust Fuzzy Algorithm for Unsupervised Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Intelligent Systems