MapReduce Programming Model Research Articles

Nowadays, online social networks have become an essential part of humans. However, there are some dark side to this widespread use of online social networks. One of them is the fact that many attackers have succeeded to clone celebrities’ profiles and have attracted hundreds or thousands of followers. This type of forging has caused many problems for famous people. This phenomenon is commonly known as Identity Cloning Attack which is abbreviated to ICA in the literature. ICA occurs when a malicious user selects one of the famous users of a social network as his victim. The attacker then creates a user account similar to the victim’s profile and embarks on various malicious social activities. In this paper, we have proposed an automatic method to identify cloned profiles. This method consists of three main steps and is implemented on Hadoop framework using the MapReduce programming model. In the first step, we count the number of followers of each user and store it as an attribute for their profile. In the second step, the network users are clustered based on their profile attributes and their number of followers. Subsequently, we move all the profiles within the same cluster as the victim’s profile to the next step and consider them as suspicious profiles. The victim’s profile is a profile of a celebrity, where the proposed method is conducted to verify its authenticity. In the third step, we eventually select the profile with the highest rank as the valid profile. This method of ranking the profiles is based on the outcome of PageRank algorithm in the first step. This method is easily applicable and does not require any additional information for identifying the original user account. Furthermore, this method employs a distributed processing framework, limits the search space, and decreases the required computation by clustering the profiles. We have applied the suggested method to a dataset that we collected from Instagram. Our findings were quite promising, and in some situations, we were able to identify all the cloned profiles with a 100% accuracy. The results are comparable to the best ones in this area of study.

Socially important locations are places which are frequently visited by social media users in their social media lifetime. Discovering socially important locations provides valuable information, such as which locations are frequently visited by a social media user, which locations are common for a social media user group, and which locations are socially important for a group of urban area residents. However, discovering socially important locations is challenging due to huge volume, velocity, and variety of social media datasets, inefficiency of current interest measures and algorithms on social media big datasets, and the need of massive spatial and temporal calculations for spatial social media analyses. In contrast, cloud computing provides infrastructure and platforms to scale compute-intensive jobs. In the literature, limited number of studies related to socially important locations discovery takes into account cloud computing systems to scale increasing dataset size and to handle massive calculations. This study proposes a cloud-based socially important locations discovery algorithm of Cloud SS-ILM to handle volume and variety of social media big datasets. In particular, in this study, we used Apache Hadoop framework and Hadoop MapReduce programming model to scale dataset size and handle massive spatial and temporal calculations. The performance evaluation of the proposed algorithm is conducted on a cloud computing environment using Turkey Twitter social media big dataset. The experimental results show that using cloud computing systems for socially important locations discovery provide much faster discovery of results than classical algorithms. Moreover, the results show that it is necessary to use cloud computing systems for analyzing social media big datasets that could not be handled with traditional stand-alone computer systems. The proposed Cloud SS-ILM algorithm could be applied on many application areas, such as targeted advertisement of businesses, social media utilization of cities for city planners and local governments, and handling emergency situations.

MapReduce Programming Model Research Articles

Related Topics

Articles published on MapReduce Programming Model

An efficient algorithm for identifying (ℓ, d) motif from huge DNA datasets

Automatic ICA detection in online social networks with PageRank

Job failure prediction in Hadoop based on log file analysis

Scalable Mining of Contextual Outliers Using Relevant Subspace

Cloud Computing-Based Socially Important Locations Discovery on Social Media Big Datasets

Distributed Subtrajectory Join on Massive Datasets

Design and implementation of ITS Architecture based on Big Data

STDADS: An Efficient Slow Task Detection Algorithm for Deadline Schedulers.

Key Technologies of Intelligent Transportation Based on Image Recognition and Optimization Control

Parallel Implementation of Improved K-Means Based on a Cloud Platform

Mobile Agent Based Distributed Network Architecture with Map Reduce Programming Model

Analysis of KNN Algorithm with Mapreduce Technique on Big Data

MR-AntMiner: A Novel MapReduce Classification Rule Discovery with Ant Colony Intelligence

A non-group parallel frequent pattern mining algorithm based on conditional patterns

Comprehend the Performance of MapReduce Programming model for K-Means algorithm on Hadoop Cluster

A multilingual fuzzy approach for classifying Twitter data using fuzzy logic and semantic similarity

An Improved Parallelization of K-means Algorithm based on HADOOP

A Survey on Implementation of Word-Count with Map Reduce Programming Oriented Model Using Hadoop Framework

Extraction of MapReduce-based features from spectrograms for audio-based surveillance

Hybrid Parallel Linguistic Fuzzy Rules with Canopy MapReduce for Big Data Classification in Cloud

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

MapReduce Programming Model Research Articles

Related Topics

Articles published on MapReduce Programming Model

An efficient algorithm for identifying (ℓ, d) motif from huge DNA datasets

Automatic ICA detection in online social networks with PageRank

Job failure prediction in Hadoop based on log file analysis

Scalable Mining of Contextual Outliers Using Relevant Subspace

Cloud Computing-Based Socially Important Locations Discovery on Social Media Big Datasets

Distributed Subtrajectory Join on Massive Datasets

Design and implementation of ITS Architecture based on Big Data

STDADS: An Efficient Slow Task Detection Algorithm for Deadline Schedulers.

Key Technologies of Intelligent Transportation Based on Image Recognition and Optimization Control

Parallel Implementation of Improved K-Means Based on a Cloud Platform

Mobile Agent Based Distributed Network Architecture with Map Reduce Programming Model

Analysis of KNN Algorithm with Mapreduce Technique on Big Data

MR-AntMiner: A Novel MapReduce Classification Rule Discovery with Ant Colony Intelligence

A non-group parallel frequent pattern mining algorithm based on conditional patterns

Comprehend the Performance of MapReduce Programming model for K-Means algorithm on Hadoop Cluster

A multilingual fuzzy approach for classifying Twitter data using fuzzy logic and semantic similarity

An Improved Parallelization of K-means Algorithm based on HADOOP

A Survey on Implementation of Word-Count with Map Reduce Programming Oriented Model Using Hadoop Framework

Extraction of MapReduce-based features from spectrograms for audio-based surveillance

Hybrid Parallel Linguistic Fuzzy Rules with Canopy MapReduce for Big Data Classification in Cloud