Comparative study on textual data set using fuzzy clustering algorithms

Rjiba Sadika,Saloua Benammou,Moez Soltani

doi:10.1108/k-11-2015-0301

Abstract

Purpose The purpose of this paper is to apply the Takagi-Sugeno (T-S) fuzzy model techniques in order to treat and classify textual data sets with and without noise. A comparative study is done in order to select the most accurate T-S algorithm in the textual data sets. Design/methodology/approach From a survey about what has been termed the “Tunisian Revolution,” the authors collect a textual data set from a questionnaire targeted at students. Five clustering algorithms are mainly applied: the Gath-Geva (G-G) algorithm, the modified G-G algorithm, the fuzzy c-means algorithm and the kernel fuzzy c-means algorithm. The authors examine the performances of the four clustering algorithms and select the most reliable one to cluster textual data. Findings The proposed methodology was to cluster textual data based on the T-S fuzzy model. On one hand, the results obtained using the T-S models are in the form of numerical relationships between selected keywords and the rest of words constituting a text. Consequently, it allows the authors to interpret these results not only qualitatively but also quantitatively. On the other hand, the proposed method is applied for clustering text taking into account the noise. Originality/value The originality comes from the fact that the authors validate some economical results based on textual data, even if they have not been written by experts in the linguistic fields. In addition, the results obtained in this study are easy and simple to interpret by the analysts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative study on textual data set using fuzzy clustering algorithms

Abstract

Talk to us

Similar Papers

More From: Kybernetes

Lead the way for us

Journal: Kybernetes	Publication Date: Sep 5, 2016
Citations: 5

Similar Papers

Clustering Approach Using Multiobjective Non-Dominated Sorting Teaching Learning Based Optimization with Kernel Fuzzy C-Means Algorithm (NSTLBO-KFCM)
Saumya Singh ... Smriti Srivastava
-
Saumya Singh, et. al.Saumya Singh ... Smriti Srivastava
01 May 2023
01 May 2023

An Adaptive Image Segmentation Method Based on Kernel FCM Algorithm
Huang Zhenhai ... Li Yuntang
-
Huang Zhenhai, et. al.Huang Zhenhai ... Li Yuntang
01 Jul 2016
01 Jul 2016

Neighbor sample membership weighted KFCM algorithm for remote sensing image classification
Xiao Wang ... Xiao-Fang Liu
-
Xiao Wang, et. al. Xiao Wang ... Xiao-Fang Liu
01 Dec 2012
01 Dec 2012

A new extension of fuzzy C-Means algorithm using non Euclidean distance and kernel methods
Bouzbida Mohamed ... Chaari Abdelkader
-
Bouzbida Mohamed, et. al.Bouzbida Mohamed ... Chaari Abdelkader
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative study on textual data set using fuzzy clustering algorithms

Abstract

Talk to us

Similar Papers

More From: Kybernetes