Social Spam Detection Research Articles

Online Social Networks (OSNs) allow easy membership leading to registration of a huge population and generation of voluminous information. These characteristics attract spammers to spread spam which may cause annoyance, financial loss, or personal information loss to the user and also weaken the reputation of social network sites. Most of the spam detection methods are based on user and content-based features using machine learning techniques. But, these annotated features are difficult to extract in real-time due to the privacy policy of most social network sites. Even for the features that can be extracted, because of their large size, the manual extraction process is complex and time-consuming. So there is a need for text level spam detection that does not require extraction of hard-core features. Existing deep learning based or existing single attention mechanism based text classification methods could not perform well as social network data are sparse with short texts and noises. Moreover, Spammers avoid direct spam words and use indirect words to evade spam filtering techniques and thus resulting in the dynamic and non-stationary nature of the social network spam texts. These indirect words contain hidden context that creates attention drift problem. So conjoint attention mechanism along with two attention mechanisms namely normal attention and context preserving attention are proposed to avoid attention drift problem in this deep learning-based text level spam detection technique (TextSpamDetector). Attention drift problem is solved by one attention mechanism which helps to find the important words while another attention mechanism allows focusing on attention in target context by referring to higher level abstraction of context vector. These attention mechanisms are referring to different context representations of the input text for finding informative words from the structural context representation. This structural context representation containing both local semantic features as well as global semantic dependency features is generated by CNN and BiLSTM. The proposed model is evaluated with the existing spam detection techniques using three datasets and the experimental results have proved that the proposed model performs well in terms of accuracy, F measure, and false-positive rate.

Read full abstract

Social media such as Facebook, MySpace, and Twitter have become increasingly important for attracting millions of users. Consequently, spammers are increasing using such networks for propagating spam. Although existing filtering techniques such as collaborative filters and behavioral analysis filters are able to significantly reduce spam, each social network needs to build its own independent spam filter and support a spam team to keep spam prevention techniques current. To alleviate those problems, we propose a framework for spam analytics and detection which can be used across all social network sites. Specifically, the proposed framework SPADE has numerous benefits including (1) new spam detected on one social network can quickly be identified across social networks; (2) accuracy of spam detection will be improved through cross-domain classification and associative classification; (3) other techniques (such as blacklists and message shingling) can be integrated and centralized; (4) new social networks can plug into the system easily, preventing spam at an early stage. In SPADE, we present a uniform schema model to allow cross-social network integration. In this paper, we define the user, message, and web page model. Moreover, we provide an experimental study of real datasets from social networks to demonstrate the flexibility and feasibility of our framework. We extensively evaluated two major classification approaches in SPADE: cross-domain classification and associative classification. In cross-domain classification, SPADE achieved over 0.92 F-measure and over 91 % detection accuracy on web page model using Naive Bayes classifier. In associative classification, SPADE also achieved 0.89 F-measure on message model and 0.87 F-measure on user profile model, respectively. Both detection accuracies are beyond 85 %. Based on those results, our SPADE has been demonstrated to be a competitive spam detection solution to social media.

Read full abstract

Social Spam Detection Research Articles

Related Topics

Articles published on Social Spam Detection

Hybrid ensemble framework with self-attention mechanism for social spam detection on imbalanced data

Achieving Online and Scalable Information Integrity by Harnessing Social Spam Correlations

SOCIAL MEDIA SPAM DETECTION USING DIFFERENT TEXT FEATURE SELECTION TECHNIQUE AND MACHINE LEARNING

An intelligent system for multi-topic social spam detection in microblogging

Boosting Social Spam Detection via Attention Mechanisms on Twitter

Heterogeneous Ensemble with Combined Dimensionality Reduction for Social Spam Detection

TextSpamDetector: textual content based deep learning framework for social spam detection using conjoint attention mechanism

SimilCatch: Enhanced social spammers detection on Twitter using Markov Random Fields

GAMEFEST: Genetic Algorithmic Multi Evaluation measure based FEature Selection Technique for social network spam detection

Online Social Networking services and Spam Detection Approaches in Opinion Mining- A review

Mining Based Design and Analysis of Social Spam Detection in Micro-blogging

Who are the spoilers in social media marketing? Incremental learning of latent semantics for social spam detection

Recent developments in social spam detection and combating techniques: A survey

SPADE: a social-spam analytics and detection framework

Virtual Celebrator Machine

LSSVM-Based Social Spam Detection Model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Social Spam Detection Research Articles

Related Topics

Articles published on Social Spam Detection

Hybrid ensemble framework with self-attention mechanism for social spam detection on imbalanced data

Achieving Online and Scalable Information Integrity by Harnessing Social Spam Correlations

SOCIAL MEDIA SPAM DETECTION USING DIFFERENT TEXT FEATURE SELECTION TECHNIQUE AND MACHINE LEARNING

An intelligent system for multi-topic social spam detection in microblogging

Boosting Social Spam Detection via Attention Mechanisms on Twitter

Heterogeneous Ensemble with Combined Dimensionality Reduction for Social Spam Detection

TextSpamDetector: textual content based deep learning framework for social spam detection using conjoint attention mechanism

SimilCatch: Enhanced social spammers detection on Twitter using Markov Random Fields

GAMEFEST: Genetic Algorithmic Multi Evaluation measure based FEature Selection Technique for social network spam detection

Online Social Networking services and Spam Detection Approaches in Opinion Mining- A review

Mining Based Design and Analysis of Social Spam Detection in Micro-blogging

Who are the spoilers in social media marketing? Incremental learning of latent semantics for social spam detection

Recent developments in social spam detection and combating techniques: A survey

SPADE: a social-spam analytics and detection framework

Virtual Celebrator Machine

LSSVM-Based Social Spam Detection Model