Construction of weak and strong similarity measures for ordered sets of documents using fuzzy set techniques

L Egghe,C Michel

doi:10.1016/s0306-4573(02)00027-4

Abstract

Ordered sets of documents are encountered more and more in information distribution systems, such as information retrieval systems. Classical similarity measures for ordinary sets of documents hence need to be extended to these ordered sets. This is done in this paper using fuzzy set techniques. First a general similarity measure is developed which contains the classical strong similarity measures such as Jaccard, Dice, Cosine and which contains the classical weak similarity measures such as Recall and Precision. Then these measures are extended to comparing fuzzy sets of documents. Measuring the similarity for ordered sets of documents is a special case of this, where, the higher the rank of a document, the lower its weight is in the fuzzy set. Concrete forms of these similarity measures are presented. All these measures are new and the ones for the weak similarity measures are the first of this kind (other strong similarity measures have been given in a previous paper by Egghe and Michel). Some of these measures are then tested in the IR-system Profil-Doc. The engine SPIRIT © extracts ranked documents sets in three different contexts, each for 600 request. The practical useability of the OS-measures is then discussed based on these experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Construction of weak and strong similarity measures for ordered sets of documents using fuzzy set techniques

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Journal: Information Processing & Management	Publication Date: May 28, 2002
Citations: 37

Similar Papers

A Bidirectional Subsethood Based Similarity Measure for Fuzzy Sets
Shaily Kabir ... Christian Wagner
-
Shaily Kabir, et. al.Shaily Kabir ... Christian Wagner
01 Jul 2018
01 Jul 2018

Cosine similarity and distance measures for [formula omitted] quasirung orthopair fuzzy sets: Applications in investment decision-making
Muhammad Rahim ... Thabet Abdeljawad
Heliyon | VOL. 10
Muhammad Rahim, et. al.Muhammad Rahim ... Thabet Abdeljawad
31 May 2024
Heliyon | VOL. 10

Emerging trends in soft set theory and related topics.
Feng Feng ... Violeta Leoreanu-Fotea
The Scientific World Journal | VOL. 2015
Feng Feng, et. al.Feng Feng ... Violeta Leoreanu-Fotea
01 Jan 2015
The Scientific World Journal | VOL. 2015

Studies on Fuzzy Information Measures
Shifei Ding ... Shixiong Xia
-
Shifei Ding, et. al.Shifei Ding ... Shixiong Xia
01 Jan 2007
01 Jan 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Construction of weak and strong similarity measures for ordered sets of documents using fuzzy set techniques

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management