Clustering Social Networking Data With K-Means Algorithm Using R Language

Sujeet Kumar Sahani Sujeet Kumar Sahani,Dr Sonam Singh Dr Sonam Singh

doi:10.32628/cseit24104105

Clustering Social Networking Data With K-Means Algorithm Using R Language

Sujeet Kumar Sahani Sujeet Kumar Sahani, Dr Sonam Singh Dr Sonam Singh

Open Access

https://doi.org/10.32628/cseit24104105

Copy DOI

Journal: International Journal of Scientific Research in Computer Science, Engineering and Information Technology	Publication Date: Jul 1, 2024
License type: CC BY 4.0

#Large Network Datasets #Sequential Algorithms + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The main objectives of this research work are to report detailed empirical studies on sequential and parallel algorithms for diverse clustering tasks executed on very large social network datasets using memory efficient out-of-core approaches. We evaluate the spark implementation for R on Cloudera using the data from social media review datasets like k-means and hierarchical clustering to rank these algorithms. This implementation leverages the YouTube dataset from UCI Machine Learning Repository. Our goal is to compare a few algorithms, so we can know exactly how accurately these models are performing. Ultimately we want to deal with testing and ranking clustering method, and mining and finally clustering massive amounts of unstructured data.

Full Text