Development of Document Clustering Technique for Gurmukhi Script using Fuzzy Term Weight

Mukesh Kumar,Amandeep Verma

doi:10.35940/ijrte.b2386.078219

Abstract

Document clustering is an unsupervised machine learning technique which designates the creation of classes of a certain number of similar objects without prior knowledge of data-sets. These classes of similar objects are known as clusters; each cluster consists unlabeled data objects in such a way that data objects within the same cluster have maximum similarity and have dissimilarity to the data objects of other groups. The purpose of this research work is to develop domain independent Gurmukhi script clustering technique. It is the first ever effort as no prior work has been done to develop domain independent clustering technique for Gurmukhi script. In this paper, a hybrid algorithm for the development of document clustering technique for Gurmukhi script has been developed. The experimental results of proposed document clustering technique reveal that the proposed hybrid technique performs better in terms of defining number of clusters, creation of meaningful cluster titles, and in terms of performance regarding assignment of real time unlabeled data sets to the relevant cluster as a result of various pre-processing steps like segmentation, stemming, normalization as well as extraction of named/noun entities, creation of cluster titles and placing text documents into relevant clusters using fuzzy term weight.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development of Document Clustering Technique for Gurmukhi Script using Fuzzy Term Weight

Abstract

Talk to us

Similar Papers

More From: International Journal of Recent Technology and Engineering (IJRTE)

Lead the way for us

Similar Papers

Automatic Scientific Document Clustering Using Self-organized Multi-objective Differential Evolution
Naveen Saini ... Pushpak Bhattacharyya
Cognitive Computation | VOL. 11
Naveen Saini, et. al.Naveen Saini ... Pushpak Bhattacharyya
19 Dec 2018
Cognitive Computation | VOL. 11

Comparing the Performance of SOM with Traditional Methods for Document Clustering Using Wordnet Ontologies
Abhishek Sawalkar ... Dr Ratnamala S Paswan
International Journal for Research in Applied Science and Engineering Technology | VOL. 10
Abhishek Sawalkar, et. al.Abhishek Sawalkar ... Dr Ratnamala S Paswan
30 Apr 2022
International Journal for Research in Applied Science and Engineering Technology | VOL. 10

A similarity assessment technique for effective grouping of documents
Tanmay Basu ... C.A Murthy
Information Sciences | VOL. 311
Tanmay Basu, et. al.Tanmay Basu ... C.A Murthy
21 Mar 2015
Information Sciences | VOL. 311

Clustering News Articles using Efficient Similarity Measure and N-grams
Rajesh Prasad ... Desmond Bisandu
International Journal of Knowledge Engineering and Data Mining | VOL. 5
Rajesh Prasad, et. al.Rajesh Prasad ... Desmond Bisandu
01 Jan 2018
International Journal of Knowledge Engineering and Data Mining | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of Document Clustering Technique for Gurmukhi Script using Fuzzy Term Weight

Abstract

Talk to us

Similar Papers

More From: International Journal of Recent Technology and Engineering (IJRTE)