Document Clustering: A Detailed Review

Neepa Shah,Sunita Mahajan

doi:10.5120/ijais12-450691

Abstract

Document clustering is automatic organization of documents into clusters so that documents within a cluster have high similarity in comparison to documents in other clusters. It has been studied intensively because of its wide applicability in various areas such as web mining, search engines, and information retrieval. It is measuring similarity between documents and grouping similar documents together. It provides efficient representation and visualization of the documents; thus helps in easy navigation also. In this paper, we have given overview of various document clustering methods studied and researched since last few years, starting from basic traditional methods to fuzzy based, genetic, coclustering, heuristic oriented etc. Also, the document clustering procedure with feature selection process, applications, challenges in document clustering, similarity measures and evaluation of document clustering algorithm is explained.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Document Clustering: A Detailed Review

Abstract

Talk to us

Similar Papers

More From: International Journal of Applied Information Systems

Lead the way for us

Journal: International Journal of Applied Information Systems	Publication Date: Oct 10, 2012
Citations: 69

Similar Papers

Investigate the Performance of Document Clustering Approach Based on Association Rules Mining
Noha Negm ... Passent Elkafrawy
International Journal of Advanced Computer Science and Applications | VOL. 4
Noha Negm, et. al.Noha Negm ... Passent Elkafrawy
01 Jan 2013
International Journal of Advanced Computer Science and Applications | VOL. 4

A Fuzzy based Document Clustering Algorithm
A Kakoti ... Kabita Thaoroijam
International Journal of Computer Applications | VOL. 151
A Kakoti, et. al.A Kakoti ... Kabita Thaoroijam
17 Oct 2016
International Journal of Computer Applications | VOL. 151

An Improved Document Clustering Approach with Multi-Viewpoint Based on Different Similarity Measures
Aniali Gunta ... Rahul Dubey
-
Aniali Gunta, et. al.Aniali Gunta ... Rahul Dubey
01 Jun 2018
01 Jun 2018

Mastering Web Mining and Information Retrieval in the Digital Age
Kijpokin Kasemsap
-
Kijpokin KasemsapKijpokin Kasemsap
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Document Clustering: A Detailed Review

Abstract

Talk to us

Similar Papers

More From: International Journal of Applied Information Systems