Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Özge Pasin,Handan Ankaralı

doi:10.18535/ijmsci/v4i3.08

Abstract

There have been more than 50 type clustering algorithms developed for getting meaningful information from big datasets and grouping individuals according to their characteristics. In actual researches, it is often seen that data involves all types of variables. In this case, it is very important to select appropriate clustering algorithm according to different data types. In this study, we will provide information about EM(Expectation Maximization),Two–Step Clustering methods which are developed in recent years and one of the best methods for data sets containing mixed types of variables. And the second aim is to compare the methods by producing a data set from health field information.These algorithms are generally recommended for large data sets but there are also used n medium-sized data sets. Medium- sized data sets are more often in actual researches.Therefore, fifty people for control group and fifty people for patients that have polycystic over syndrome were taken to the study. Totally nineteen variables were measured from these subjects and thirteen of them were quantitative, six of them were qualitative.Clusters were obtained by EM and Two-Step cluster methods.To evaluate the relationships between the clusters obtained from algorithms and actually known patient, control groups were analyzed by Kappa coefficient. It was found that EM clustering algorithm has highest compliance coefficient comparing with Two-Step cluster(Kappa=0,740;p

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Abstract

Talk to us

Similar Papers

More From: International Journal Of Medical Science And Clinical Invention

Lead the way for us

Journal: International Journal Of Medical Science And Clinical Invention	Publication Date: Mar 19, 2017
Citations: 1

Similar Papers

A robust EM clustering approach: ROBEM
Yüksel Öner ... Hasan Bulut
Communications in Statistics - Theory and Methods | VOL. 50
Yüksel Öner, et. al.Yüksel Öner ... Hasan Bulut
03 Feb 2020
Communications in Statistics - Theory and Methods | VOL. 50

Use of a fuzzy granulation–degranulation criterion for assessing cluster validity
Sanghamitra Bandyopadhyay ... Witold Pedrycz
Fuzzy Sets and Systems | VOL. 170
Sanghamitra Bandyopadhyay, et. al.Sanghamitra Bandyopadhyay ... Witold Pedrycz
08 Dec 2010
Fuzzy Sets and Systems | VOL. 170

Performance Assessment of Some Clustering Algorithms Based on a Fuzzy Granulation-Degranulation Criterion
Sriparna Saha ... Sanghamitra Bandyopadhyay
-
Sriparna Saha, et. al.Sriparna Saha ... Sanghamitra Bandyopadhyay
01 Dec 2007
01 Dec 2007

Use of symmetry and stability for data clustering
Sriparna Saha ... Ujjwal Maulik
Evolutionary Intelligence | VOL. 3
Sriparna Saha, et. al.Sriparna Saha ... Ujjwal Maulik
01 Aug 2010
Evolutionary Intelligence | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Abstract

Talk to us

Similar Papers

More From: International Journal Of Medical Science And Clinical Invention