A Well Organized Phrase-Based Document Clustering Using ASCII Values and Adjacency List

Srikanth Lukka,Rizwana Shaik

doi:10.1007/978-3-319-60618-7_12

Abstract

Document Clustering is the process of collecting similar kind of documents into one group based on any particular similarity function. Document clustering is also referred as text clustering. Informative features like phrases and their weights are considered to be more important to perform efficient document clustering. This paper mainly deals on two key parts for achieving efficient document clustering. The first part is a phrase based document model named as the Document Adjacency List, it explains about the construction of a phrase based model of the document set. It produces efficient phrase matching which is useful to decide the similarity among the documents. The second part is the document clustering algorithm that is proposed to enhance the Document Adjacency List for clustering based on the similarity measure. The combination of the above two parts leads to better calculation of similarity among documents and similarity further helps to calculate document clustering.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Well Organized Phrase-Based Document Clustering Using ASCII Values and Adjacency List

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Document Clustering - Concepts, Metrics and Algorithms
Tomasz Tarczynski
International Journal of Electronics and Telecommunications | VOL. 57
Tomasz TarczynskiTomasz Tarczynski
01 Sep 2011
International Journal of Electronics and Telecommunications | VOL. 57

Exploring diseases based biomedical document clustering and visualization using self-organizing maps
Setu Shah ... Xiao Luo
-
Setu Shah, et. al.Setu Shah ... Xiao Luo
01 Oct 2017
01 Oct 2017

Web Document Clustering Using Document Index Graph
B F Momin ... Amol Chaudhari
-
B F Momin, et. al.B F Momin ... Amol Chaudhari
01 Dec 2006
01 Dec 2006

Efficient phrase-based document indexing for Web document clustering
K.M Hammouda ... M.S Kamel
IEEE Transactions on Knowledge and Data Engineering | VOL. 16
K.M Hammouda, et. al.K.M Hammouda ... M.S Kamel
01 Oct 2004
IEEE Transactions on Knowledge and Data Engineering | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Well Organized Phrase-Based Document Clustering Using ASCII Values and Adjacency List

Abstract

Talk to us

Similar Papers