COVID-19 Dataset Clustering based on K-Means and EM Algorithms

Youssef Boutazart,Khaled Satori,Hassan Satori,Anselme R Affane M,Mohamed Hamidi

doi:10.14569/ijacsa.2023.01403105

Youssef Boutazart, Khaled Satori + Show 3 more

Open Access

https://doi.org/10.14569/ijacsa.2023.01403105

Copy DOI

Abstract

In this paper, a COVID-19 dataset is analyzed using a combination of K-Means and Expectation-Maximization (EM) algorithms to cluster the data. The purpose of this method is to gain insight into and interpret the various components of the data. The study focuses on tracking the evolution of confirmed, death, and recovered cases from March to October 2020, using a two-dimensional dataset approach. K-Means is used to group the data into three categories: “Confirmed-Recovered”, “Confirmed-Death”, and “Recovered-Death”, and each category is modeled using a bivariate Gaussian density. The optimal value for k, which represents the number of groups, is determined using the Elbow method. The results indicate that the clusters generated by K-Means provide limited information, whereas the EM algorithm reveals the correlation between “Confirmed-Recovered”, “Confirmed-Death”, and “Recovered-Death”. The advantages of using the EM algorithm include stability in computation and improved clustering through the Gaussian Mixture Model (GMM).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2023
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

COVID-19 Dataset Clustering based on K-Means and EM Algorithms

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

Parameter estimation of Gaussian mixture models (GMM) with expectation maximization (EM) algorithm
Wardatul Jannah ... Dewi R S Saputro
-
Wardatul Jannah, et. al.Wardatul Jannah ... Dewi R S Saputro
01 Jan 2021
01 Jan 2021

Application of EM algorithms for seismic facices classification
Mei Han ... Yong Zhao
Computational Geosciences | VOL. 15
Mei Han, et. al.Mei Han ... Yong Zhao
23 Oct 2010
Computational Geosciences | VOL. 15

Probabilistic mapping networks for speaker recognition
Haizhou Li ... J.-P Haton
-
Haizhou Li, et. al. Haizhou Li ... J.-P Haton
07 May 1996
07 May 1996

Improving accuracy of Gaussian mixture model classifiers with additional discriminative training
Ajay M Patrikar ... John P Baker
-
Ajay M Patrikar, et. al.Ajay M Patrikar ... John P Baker
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

COVID-19 Dataset Clustering based on K-Means and EM Algorithms

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications