Possibilistic Similarity Measures for Data Science and Machine Learning Applications

Amal Charfi,Wassim Bouchaala,Basel Solaiman,Nabil Derbel,Imene Khanfir Kallel,Eloi Bosse,Sonda Ammar Bouhamed

doi:10.1109/access.2020.2979553

Amal Charfi, Wassim Bouchaala + Show 5 more

Open Access

https://doi.org/10.1109/access.2020.2979553

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 37	License type: CC BY 4.0

Affiliation: University of Sfax, IMT Atlantique

Abstract

Measuring similarity is of a great interest in many research areas such as in data sciences, machine learning, pattern recognition, text analysis and information retrieval to name a few. Literature has shown that possibility is an attractive notion in the context of distinguishability assessment and can lead to very efficient and computationally inexpensive learning schemes. This paper focuses on determining the similarity between two possibility distributions. A review of existing similarity measures within the possibilistic framework is presented first. Then, similarity measures are analyzed with respect to their capacity to satisfy a set of required properties that a similarity measure should own. Most of the existing possibilistic similarity measures produce undesirable outcomes since they generally depend on the application context. A new similarity measure, called InfoSpecificity, is introduced and the similarity measures are categorized into three main methods: morphic-based, amorphic-based and hybrid. Two experiments are being conducted using four benchmark databases. The aim of the experiments is to compare the efficiency of the possibilistic similarity measures when applied to real data. Empirical experiments have shown good results for the hybrid methods, particularly with the InfoSpecificity measure. In general, the hybrid methods outperform the other two categories when evaluated on small-size samples, i.e., poor-data context (or poor-informed environment) where possibility theory can be used at the greatest benefit.

Highlights

Determining similarities is part of a fundamental process of a human sense-making mechanism that consists of three elements: an object or event, a mental model, and an association between them [1]
The notion of similarity has been exploited in various fields of Computer Sciences [2]–[5] such as in machine learning pattern recognition [4], classification [6], image processing [7] and decision making [5]
We propose to group the possibilistic similarity measures into three categories: those based on the evaluation of the morphic aspect, those based on the magnitude as amorphic-based ones, and the hybrid category that combines morphic and amorphic criteria to assess similarity

Summary

Introduction

Determining similarities is part of a fundamental process of a human sense-making mechanism that consists of three elements: an object or event, a mental model, and an association between them [1]. The notion of similarity has been exploited in various fields of Computer Sciences [2]–[5] such as in machine learning pattern recognition [4], classification [6], image processing [7] and decision making [5]. Similarity in a machine learning context is required to compute the ‘‘closeness’’ between elements in a dataset. It allows to understand the structure within the input data [8]. Refining the estimation of similarity scores leads to the improvement of algorithms accuracy as well as the minimization of errors and confusions

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Possibilistic Similarity Measures for Data Science and Machine Learning Applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Some Linear Diophantine Fuzzy Similarity Measures and Their Application in Decision Making Problem
Maha M Saeed Mohammad ... Saleem Abdullah
IEEE Access | VOL. 10
Maha M Saeed Mohammad, et. al.Maha M Saeed Mohammad ... Saleem Abdullah
01 Jan 2021
IEEE Access | VOL. 10

Spherical Fuzzy Sets-Based Cosine Similarity and Information Measures for Pattern Recognition and Medical Diagnosis
Tahir Mahmood ... Abdu Gumaei
IEEE Access | VOL. 9
Tahir Mahmood, et. al.Tahir Mahmood ... Abdu Gumaei
01 Jan 2020
IEEE Access | VOL. 9

Machine learning in pain research.
Jörn Lötsch ... Alfred Ultsch
Pain | VOL. 159
Jörn Lötsch, et. al.Jörn Lötsch ... Alfred Ultsch
24 Nov 2017
Pain | VOL. 159

A family of similarity measures for q‐rung orthopair fuzzy sets and their applications to multiple criteria decision making
Bahram Farhadinia ... Francisco Chiclana
International Journal of Intelligent Systems | VOL. 36
Bahram Farhadinia, et. al.Bahram Farhadinia ... Francisco Chiclana
04 Jan 2021
International Journal of Intelligent Systems | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Possibilistic Similarity Measures for Data Science and Machine Learning Applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access