Clustering of Social Networking Data Using SparkR in Big Data

Navneet Kaur,Niranjan Lal

doi:10.1007/978-981-13-1813-9_22

Abstract

Due to every day growing amount of data and changing the formats, the storing and management of these data is the challenging task for the organizations. Not long ago, datasets contained thousands of data items. Currently, different technologies can store, manage and process data with increasing volumes of unstructured and heterogeneous data, data of this type are known as Big Data. Big Data is the period for a group of such huge and complicated datasets that makes it problematic to store, manage and process with existing data processing tools. Now, in Big Data, maximum of the data created is not structured. Therefore, the new situations imposed by Big Data present grave challenges at multiple levels, together with clustering problem of these data. Clustering is one of the significant Big Data analysis problems, where very large amount of heterogeneous and unstructured data must be grouped together. Here we have describe the k-mean and hierarchical clustering methods; great attention to k-means method lends itself because it remains one of the most sought-after other approaches and it is also implemented in innovative technologies for analyzing Big Data. This paper describes different categories of data, the management of unstructured data in Big Data and the clustering analysis of social network data using SparkR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering of Social Networking Data Using SparkR in Big Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Uncertain Unstructured Data Management: Put All Apple in Nine Basket
Manoj Kumar Jain
International Journal of Big Data Security Intelligence | VOL. 2
Manoj Kumar JainManoj Kumar Jain
30 Jun 2015
International Journal of Big Data Security Intelligence | VOL. 2

Framework and key technologies for big data based on manufacturing
Shan Ren ... Xin Zhao
-
Shan Ren, et. al.Shan Ren ... Xin Zhao
01 Jan 2015
01 Jan 2015

Multi-source heterogeneous Hakka culture heritage data management based on MongoDB
Qunyong Wu ... Ying Jiang
-
Qunyong Wu, et. al.Qunyong Wu ... Ying Jiang
01 Jul 2016
01 Jul 2016

Structured and Unstructured Big Data Analytics
Suyash Mishra ... Anuranjan Misra
-
Suyash Mishra, et. al.Suyash Mishra ... Anuranjan Misra
01 Sep 2017
01 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering of Social Networking Data Using SparkR in Big Data

Abstract

Talk to us

Similar Papers