A Comment on “A Similarity Measure for Text Classification and Clustering”

Naresh Kumar Nagwani

doi:10.1109/tkde.2015.2451616

Abstract

A similarity measure namely, similarity measure for text processing (SMTP) is proposed by Lin et al. [1] for knowledge discovery on text collection. The proposed measure considered the three cases for similarity measurements between the pairs of documents. These cases are based on absence and presence of features in the pair of text documents. The first case covers the features appearing in both of the documents, second case covers the features appears in only one document and the third case covers the features appears in none of the documents. The proposed similarity measure considered to be ideal for finding similarity between the pair of text documents on the basis of presence or absence of features available in text documents, however, while exploring the SMTP similarity measurement it is found that the case of measuring similarity between the pair of similar documents is not covered. The objective of this work is to highlight this gap and propose a minor change to make the SMTP a complete similarity measurement technique for knowledge discovery in line with the other standard similarity techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comment on “A Similarity Measure for Text Classification and Clustering”

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Sep 1, 2015
Citations: 22

Similar Papers

A new approach to construct similarity measure for intuitionistic fuzzy sets
Yafei Song ... Wenlong Huang
Soft Computing | VOL. 23
Yafei Song, et. al.Yafei Song ... Wenlong Huang
03 Nov 2017
Soft Computing | VOL. 23

New similarity measures for single-valued neutrosophic sets with applications in pattern recognition and medical diagnosis problems
Jia Syuen Chai ... Ganeshsree Selvachandran
Complex & Intelligent Systems | VOL. 7
Jia Syuen Chai, et. al.Jia Syuen Chai ... Ganeshsree Selvachandran
07 Dec 2020
Complex & Intelligent Systems | VOL. 7

An evidential view of similarity measure for Atanassov’s intuitionistic fuzzy sets
Yafei Song ... Lei Lei
Journal of Intelligent & Fuzzy Systems | VOL. 31
Yafei Song, et. al.Yafei Song ... Lei Lei
13 Aug 2016
Journal of Intelligent & Fuzzy Systems | VOL. 31

A novel similarity measure between Atanassov’s intuitionistic fuzzy sets based on transformation techniques with applications to pattern recognition
Shyi-Ming Chen ... Chia-Hao Chang
Information Sciences | VOL. 291
Shyi-Ming Chen, et. al.Shyi-Ming Chen ... Chia-Hao Chang
27 Aug 2014
Information Sciences | VOL. 291

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comment on “A Similarity Measure for Text Classification and Clustering”

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering