Soft Set Multivariate Distribution for Categorical Data Clustering

Iwan Tri Riyadi Yanto,Rohmat Saedudin,Sely Novita Sari,Norhalina Senan,Mustafa Mat Deris

doi:10.18517/ijaseit.11.5.15420

Iwan Tri Riyadi Yanto, Rohmat Saedudin + Show 3 more

Open Access

https://doi.org/10.18517/ijaseit.11.5.15420

Copy DOI

Abstract

<p class='IJASEITAbtract'>Clustering is the process of breaking down a huge dataset into smaller groups. It has been used in some field studies including pattern recognition, segmentation, and statistics with remarkable success. Clustering is a technique for dividing multivariate datasets into groups. No inherent distance measure on data category makes clustering data more challenging than numerical data. Data category can be assumed following the data from a multinomial distribution. Thus, the standard model parametric model can be used in latent class clustering based on the independent product of multinomial distributions. Meanwhile, multi-valued attributes on the categorical data can be decomposed into the standard set on a multi soft set. In this paper, a clustering technique based on soft set theory is proposed for categorical data through a multinomial distribution. The data will be represented as a multi soft set which is every soft set has its probability of being a member of the cluster. The data with the highest probability will be assigned as the member of the cluster. The experiment of the proposed technique is evaluated based on the Dunn index with regard to the number of clusters and response time. The experiment results show that the proposed technique has the lowest response time with high stability compared to baseline techniques. This study recommends a maximum number of clusters in implementation on the real data.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal on Advanced Science, Engineering and Information Technology	Publication Date: Oct 17, 2021
Citations: 1	License type: cc-by-sa

R Discovery Prime

R Discovery Prime

Soft Set Multivariate Distribution for Categorical Data Clustering

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology

Lead the way for us

Similar Papers

Emerging trends in soft set theory and related topics.
Feng Feng ... Muhammad Akram
TheScientificWorldJournal | VOL. 2015
Feng Feng, et. al.Feng Feng ... Muhammad Akram
01 Jan 2015
TheScientificWorldJournal | VOL. 2015

The relationship among soft sets, soft rough sets and topologies
Zhaowen Li ... Tusheng Xie
Soft Computing | VOL. 18
Zhaowen Li, et. al.Zhaowen Li ... Tusheng Xie
27 Aug 2013
Soft Computing | VOL. 18

Soft sets and soft rough sets
Xiaoyan Liu ... Young Bae Jun
Information Sciences | VOL. 181
Xiaoyan Liu, et. al.Xiaoyan Liu ... Young Bae Jun
18 Nov 2010
Information Sciences | VOL. 181

Soft Rough Approximation Operators and Related Results
Zhangyong Cai ... Zhaowen Li
Journal of Applied Mathematics | VOL. 2013
Zhangyong Cai, et. al.Zhangyong Cai ... Zhaowen Li
01 Jan 2013
Journal of Applied Mathematics | VOL. 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Soft Set Multivariate Distribution for Categorical Data Clustering

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology