A survey of document clustering using semantic approach

Nagma Y Saiyad,Harshadkumar B Prajapati,Vipul K Dabhi

doi:10.1109/iceeot.2016.7755154

Abstract

Document clustering is the application of cluster analysis to textual documents. It is commonly used technique in data mining, information retrieval, knowledge discovery from data, pattern recognition, etc. In traditional document clustering, a document is considered as a bag of words; where semantic meaning of word is not taken into consideration. However, to achieve accurate document clustering, feature such as meanings of the words is important. Document clustering can be done using semantic approach because it takes semantic relationship among words into account. This paper highlights the problems in traditional approach as well as semantic approach. This paper identifies four major areas under semantic clustering and presents a survey of 23 papers that are studied, covering major significant work. Moreover, this paper also provides a survey of tools specifically used for text processing, and clustering algorithms, that help in applying and evaluating document clustering. The presented survey is used in preparing the proposed work in the same direction. This proposed work uses the sense of a word for text clustering system. Lexical chains will be used as features that are to be developed using the identity/synonymy relation from WordNet ontology as background knowledge. Later, clustering will be done using the lexical chains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A survey of document clustering using semantic approach

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A survey on semantic document clustering
Maitri P Naik ... Vipul K Dabhi
-
Maitri P Naik, et. al.Maitri P Naik ... Vipul K Dabhi
01 Mar 2015
01 Mar 2015

Text document clustering based on frequent word meaning sequences
Yanjun Li ... Soon M Chung
Data & Knowledge Engineering | VOL. 64
Yanjun Li, et. al.Yanjun Li ... Soon M Chung
30 Aug 2007
Data & Knowledge Engineering | VOL. 64

A survey on methodologies used for semantic document clustering
Aditi Gupta ... Ajay Kumar
-
Aditi Gupta, et. al.Aditi Gupta ... Ajay Kumar
01 Aug 2017
01 Aug 2017

Semantic based Document Clustering: A Detailed Review
Neepa Shah ... Sunita Mahajan
International Journal of Computer Applications | VOL. 52
Neepa Shah, et. al.Neepa Shah ... Sunita Mahajan
30 Aug 2012
International Journal of Computer Applications | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A survey of document clustering using semantic approach

Abstract

Talk to us

Similar Papers